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The invention was made with government support under grant number CA64394 
awarded by the National Institutes of Health. The government has certain rights in the 



mvention. 



1. INTRODUCTION 

1 5 The present mvention relates to the use of alpha (2) macroglobulin ("a2M") receptor 

as a heat shock protein receptor, cells that express the a2M receptor boxmd to an HSP, and 
antibodies and other molecules that bind the a2M receptor-HSP complex. The invention 
also relates to screening assays to identify compounds that modulate the interaction of an 
HSP with the a2M receptor, and methods for using compositions comprising a2M-receptor 

20 sequences for the diagnosis and treatment of immune disorders, proliferative disorders, and 
infectious diseases. 



2. BACKGROUND OF THE INVENTION 

2. 1 . HEAT SHOCK PROTEINS 

Heat shock proteins (HSPs), also referred to as stress proteins, were first identified 
as proteins synthesized by cells in response to heat shock. Hsps have classified into five 
families, based on molecular weight, HsplOO, Hsp90, Hsp70, Hsp60, and smHsp. Many 
members of these families were found subsequently to be induced in response to other 
stressfiil stimuli including nutrient deprivation, metabolic disruption, oxygen radicals, and 
infection with intracellular pathogens (see Welch, May 1993, Scientific American 56-64; 
Young, 1990, Annu. Rev. Immunol. 8:401-420; Craig, 1993, Science 260:1902-1903; 
Getiiing et aL, 1992, Nature 355:33-45; and Lindquist et a/., 1988, Annu, Rev. Genetics 
22:631-677). 

Heat shock proteras are among the most highly conserved proteins in existence. For 
example, DnaK, the Hsp70 from E, coli has about 50% amino acid sequence identity with 
Hsp70 proteins from excoriates (Bardwell et aL, 1984, Proc. Natl. Acad. Sci. 81:848-852);' 
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The Hsp60 and Hsp90 families also show similarly high levels of intra-family conservation 
(Hickey et al, 1989, Mol. Cell. Biol. 9:2615-2626; Jindal, 1989, Mol. Cell. Biol. 9:2279- 
2283). In addition, it has been discovered that the Hsp60, Hsp70 and Hsp90 famihes are 
composed of proteins that are related to the stress proteins in sequence, for example, having 

5 greater than 35% amino acid identity, but whose expression levels are not altered by stress. 
Studies on the cellular response to heat shock and other physiological stresses 
revealed that the HSPs are iovolved not only in cellular protection against these adverse 
conditions, but also in essential biochemical and immunological processes in unstressed 
cells. HSPs accomplish different kinds of chaperoning functions. For example, members 

2Q of the Hsp70 family, located in the cell cytoplasm, nucleus, mitochondria, or endoplasmic 
reticulum (Lindquist et aL, 1988, Ann. Rev. Genetics 22:631-677), are involved in the 
presentation of antigens to the cells of the immune system, and are also involved in the 
transfer, folding and assembly of proteins in normal cells. HSPs are capable of binding 
proteins or peptides, and releasing the bound proteins or peptides in the presence of 

J 5 adenosine triphosphate (ATP) or low pH. 

2.2. IMMUNOGEMCITY OF HSP-PEPTIDE COMPLEXES 

Srivastava et al demonstrated immune response to methylcholanthrene-induced 
sarcomas of inbred mice (1988, Immtmol. Today 9:78-83). In these studies, it was found 

20 that the molecules responsible for tiie individually distinct immunogenicity of these tumors 
were glycoproteins of 96kDa (gp96) and intracellular proteins of 84 to 86kDa (Srivastava et 
al, 1986, Proc. Natl. Acad. Sci. USA 83:3407-341 1; UUrich et al, 1986, Proc. Natl. Acad. 
Sci. USA 83:3121-3125). Immunization of mice with gp96 or p84/86 isolated from a 
particular tumor rendered the mice immune to that particular tumor, but not to antigenically 

25 distinct tumors. Isolation and characterization of genes encoding gp96 and p84/86 revealed 
significant homology between them, and showed that gp96 and p84/86 were, respectively, 
the endoplasmic reticular and cytosolic coimterparts of the same heat shock proteins 
(Srivastava et al, 1988, Immunogenetics 28:205-207; Srivastava et al, 1991, Curr. Top. 
Microbiol. Immunol. 167:109-123). Further, Hsp70 was shown to eUcit immunity to the 

30 tumor from which it was isolated but not to antigenically distinct tumors. However, Hsp70 
depleted of peptides was found to lose its immunogenic activity (Udono and Srivastava, 
1993, J. Exp. Med. 178:1391-1396). These observations suggested that the heat shock 
proteins are not immimogenic per se, but fomi noncovalent complexes with antigenic 
peptides, and the complexes can elicit specific immunity to the antigenic peptides 



35 (Srivastava, 1993, Adv. Cancer Res. 62:153-177; Udono et al, 1994, J. Immunol., 
152:5398-5403; Suto etal, 1995, Science, 269:1585-1588). 
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Noncovalent complexes of HSPs and peptide, purified fi-om cancer cells, can be used 
for the treatment and prevention of cancer and have been described in PCX pubUcations WO 
96/10411, dated April 11, 1996, and WO 97/10001, dated March 20, 1997 (U.S. Patent No. 
5,750,119 issued April 12, 1998, and U.S. Patent No. 5,837,251 issued November 17, 1998, 

5 respectively, each of which is incorporated by reference herein in its entirety). The isolation 
and purification of stress protein-peptide complexes has been described, for example, fi"om 
pathogen-infected cells, and can be used for the treatment and prevention of infection caused 
by the pathogen, such as viruses, and other intracellular pathogens, including bacteria, 
protozoa, fimgi and parasites (see, for example, PCT PubUcation WO 95/24923, dated 

IQ September 21, 1995). Immunogenic stress protein-peptide complexes can also be prepared 
by in vitro complexing of stress protein and antigenic peptides, and the uses of such 
complexes for the treatment and prevention of cancer and infectious diseases has been 
described in PCT publication WO 97/10000, dated March 20, 1997 (U.S. Patent No. 
6,030,618 issued February 29, 2000. The use of stress protein-peptide complexes for 

J 5 sensitizing antigen presenting cells in vitro for use in adoptive inmaunotherapy is described 
in PCT publication WO 97/10002, dated March 20, 1997 (see also U.S. Patent No. 5,985,270 
issued November 16, 1999), 

2.3. ALPHA (2) MACROGLOBULDSf RECEPTOR 

20 The alpha (2) macroglobulin receptor (herem referred to interchangeably as either 

"a2MR" or **the a2M receptor*'), also known as LDL (low-density lipoprotein) receptor- 
Related Protein (*TRP") or CD91, is primarily expressed in liver, brain and placenta. The 
a2M receptor is a member of the low density lipoprotein receptor family. The extracellular 
domain of the human receptor comprises six 50-amitio acid EOF repeats and 31 complement 

25 repeats ofapproximately 40-42 amino acids. The complement repeats are organized, j&om 
the amino to the carboxy-tenninus, into clusters of 2, 8, 10 and 1 1 repeats, called Cluster I, 
n, m and IV (Herz et al, 1988, EMBO J. 7:41 19-4127). One study points to Cluster H (Cl- 
n), which contains complement repeats 3-10 (CR3-10), as the major ligand binding portion 
of the receptor (Horn et al, 1997, J. Biol. Chem. 272:13608-13613). The a2M receptor 

30 plays a role in endocytosis of a diversity of ligands. In addition to a2M, other ligands of 
a2MR include lipoprotein complexes, lactoferrin, tissue-type plasmmogen activator (tPA), 
urokinase-type plasminogen activator (uPA), and exotoxins. Thus, the o2M receptor plays 
roles in a variety of cellular processes, mcluding endocytosis, antigen presentation, 
cholesterol regulation, ApoE-containing lipoprotein clearance, and chylomicron remnant 

35 removal. 
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Human a2M is synthesized as a 1474 amino acid precursor, the first 23 of which 
function as a signal sequence that is cleaved to yield a 1451 amino acid mature protein (Kan 
et al., 1985, Proc. Natl. Acad. Sci. U.S .A. 82:2282-2286). In experiments with recombmant 
protein, the carboxy-terminal 138 amino acids of a2M (representing anndno acids 1314-1451 

5 of the mature protein) was found to bind the receptor. This domain has been called the RBD 
(receptor-binding domain; Salvesent et al, 1992, FEBS Lett. 313:198-202; Holtet et al, 
1994, FEBS Lett. 344:242-246). An RBD variant (RBDv), a proteolytic fragment of a2M 
comprising an additional 15 amino terminal residues (representing amino acids 1314-1451 of 
the mature protein) binds to the receptor with almost the same afSnity as a2M-proteinase 

10 (Holtet et al, 1994, FEBS Lett. 344:242-246). 

Alignment of a2MR ligands identifies a conserved domain present in ttxe RBDs of a 
macroglobulins. The conserved sequence spans amino acids 1366-1392 of human a2M. 
Conserved residues within this domain are Phejag^, Leu,3^, LyS]37o. Valjj^j, Lys,374, Glu,377, 
Val,382, Arg,384 (Nielsen et al, 1996, J. Biol. Chem. 271:12909-12912). Of these, Lysi370 and 

15 Lysi374 were shown to be critical for receptor binding (Nielsen et al, 1996, J. Biol. Chem. 
271:12909-12912). 

Binding of ligands, including the binding to a2M, to a2MR is inhibited by a2MR- 
associated protein (RAP). RAP is a 39 kDa folding chaperone that resides m the 
endoplasmic reticulum and is required for the normal processing of a2MR. RAP has the 

20 ability to competitively inhibit the binding of all a2MR to all a2MR ligands tested. One 
study shows RAP to bind to complement repeats C5-C7 in cluster n (Cl-II) of a2MR (Hom 
et al., 1997, J. Biol. Chem. 272:13608-13613); another shows RAP to bind to all two 
. complement repeat-modules in Cl-II except the C9-C10 module (Andersen et al, J. Biol, 
Chem., Mar. 24, 2000, PMID: 10747921; published electronically ahead of print). Three 

25 structural domains, 1, 2 and 3, have been identified in RAP, consisting of amino acid 
residues 18-1 12, 113-218 and 219-323, respectively. Ligand competition titration of 
recombinant RAP domains indicates that determinants for the inhibition of test ligands 
reside in the C-terminal regions of domains 1 and 3 (EUgaard et al, 1997, Eur. J. Biochem. 
244:544-51), 

30 

2.4. ANTIGEN PRESENTATION 

Major histocompatibility complex (MHC) molecules present antigens on the cell 
surface of antigen-presentmg cells. Cytotoxic T lymphocytes (CTLs) then recognize MHC 
molecules and their associated peptides and kill the target cell. Antigens are processed by 
35 two distinct antigen processing routes depending upon whether fheir origin is intracellular or 
extraceUular. Intracellular or endogenous protein antigens, Le., antigens synthesized within 
the antigen-presenting cell, are presented by MHC class I (MHC I) molecules to CD8+ 
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cytotoxic T lymphocytes. On the other hand, extracellular or exogenously synthesized 
antigenic determinants are presented on the cell surface of "specialized" or "professional" 
APCs (macrophages, for example) by MHC class n molecules to CD4+ T cells (see, 
generally, Fundamental Immunology, W.E. Paul (ed.), New York: Raven Press, 1984). This 

5 compartmental segregation of antigen processing routes is important to prevent tissue 

destruction that could otherwise occur dxiring an immune response as a result of shedding of 
neighboring cell MHC I antigens. 

The heat shock protein gp96 chaperones a wide array of peptides, depending upon the 
source from which gp96 is isolated (for review, see Srivastava et aL, 1998, Immunity 8: 657- 

IQ 665). Tumor-derived gp96 carries tumor-antigenic peptides (Ishii et aL, 1999, J. 
Immunology 162:1303-1309); gp96 preparations from virus-infected cells cany viral 
epitopes (Suto and Srivastava, 1995, Science 269:1585-1588; Nieland et aL, 1998, Proc. 
Natl. Acad. Sci. USA 95:1800-1805), and gp96 preparations from cells transfected with 
model antigens such as ovalbumin or p-galactosidase are associated with the corresponding 

15 epitopes (Arnold et aL, 1995, J. Exp. Med.l 82:885-889; Breloer et aL, 1998, Eur. J. 
Immunol. 28:1016-1021). The association of gp96 with peptides occurs in vivo (Menoret 
and Srivastava, 1999, Biochem. Biophys. Research Commun. 262:813-818). Gp96-pq)tide 
complexes, wheth^ isolated from cells (Tamura et aL, 1997, Science 278:117-120), or 
reconstituted in vitro (Blachere et aL, 1997, J. Exp. Med. 186:1183-1406) are excellent 

20 immunogens and have been used extensively to elicit 0)8+ T cell responses specific for the 
gp96-chaperoned antigenic peptides. 

The capacity of gp96-peptide complexes to elicit an immune response is dependent 
upon the transfer of the peptide to MHC class I molecules of antigen-presenting cells (Suto 
•and Srivastava, 1995, supra). Endogenously synthesized antigens chaperoned by gp96 in 

25 the endoplasmic reticulum [ER] can prime antigen-specific CD8+ T cells (or MHC I- 

restricted CTLs) in vivo; this priming of CD8+ T cells requires macrophages. However, the 
process whereby exogenously introduced gp96-peptide complexes elicit the antigen-specific 
CD8+ T cell response is not completely understood since there is no established pathway for 
the translocation of extracellular antigens into the class I presentation machinery. Yet 

30 antigenic peptides of extracellxilar origin associated with HSPs are somehow salvaged by 
macrophages, channeled into the endogenous pathway, and presented by MHC I molecules 
to be recognized by CD8+ lymphocytes (Suto and Srivastava, 1995, supra; Blachere et aL, 
1997, J. Exp. Med. 186:1315-22). 

SevOTal models have been proposed to e:q)lain the delivery of extracellular peptides 

35 for antigen presentation. One proposal, known as the "direct transfer" model, suggests that 
HSP-chaperoned peptides are transferred to MHC I molecules on the cell surface of 
macrophages for presentation to CD8+ T lymphocytes. Another suggestion is that soluble 
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extracellular proteins can be trafficked to the cytosol via constitutive macropinocytosis in 
bone marrow-derived macrophages and dendritic cells (Norbury et aL, 1997, Eur. J. 
Immunol. 27:280-288). Yet another proposed mechanism is that HSPs are taken up by the 
MHC class I molecules of the macrophage, which stimulate the appropriate T cells 

5 (Srivastava et al, 1994, Immunogenetics 39:93-98. Others have suggested that a novel 
intracellular trafficking pathway may be involved for the transport of peptides from the 
extracellular medium into the lumen of ER (Day et al, 1997, Proc. Natl. Acad. Sci. 94:8064- 
8069; Nicchitta, 1998, Curr. Opin. in Immunol. 10:103-109). Further suggestions include 
the involvement of phagocytes which (a) possess an ill-defined pathway to shunt protein 

J 0 from the phagosome into the cytosol where it would enter the normal class I pathway; (b) 
digest ingested material in lysosomes and regurgitate peptides for loading on the surface to 
class I molecules (Bevan, 1995, J. Exp. Med. 182:639-41). 

Still others have proposed a receptor-mediated pathway for the delivery of 
extracellular peptides to the cell surface of APCs for antigen presentation. In view of the 

15 extremely small quantity of gp96-chaperoned antigenic peptides required for immunization 
(Blachere et aL, 1997, supra), and the strict dependence of itnmunogenicity of gp96-peptide 
complexes on functional antigen presenting cells (APCs) (Udono et al, 1994, Proc. Nati. 
Acad. Sci. U.S.A. 91:3077-3081), APCs had been proposed to possess receptors for gp96 
(Srivastava et aL, 1994, Immunogenetics 39:93-98). Preliminary microscopic evidence 

20 consistent with such receptors has been recently obtained (Binder et al, 1998, Cell Stress & 
Chaperones 3 (Supp.l):2.; Amold-Schild etal, 1999, J. Immunol. 162: 3757-3760; and 
Wassenberg et al, 1999, J. Cell Sci. 1:12). One hypothesis is that the mannose receptor is 
used in the uptake of gp96, but no mechanism has been proposed for the non-glycosylated 
HSPs, such as Hsp70 (Ciupitu et al, 1998, J. Exp. Med., 187:685-691). 

25 The identification and characterization of specific molecules involved in HSP- 

mediated antigen presentation of peptides could provide useful reagents and techniques for 
eHciting specific immunity by HSP and HSP-pq)tide complexes, and for developing novel 
diagnostic and therapeutic methods. 

Citation or discussion of a reference herein shall not be construed as an admission 

3Q that such is prior art to the present invention. 

3. SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for the use of the alpha (2) 
macroglobulin ("a2M") receptor as a heat shock protein receptor. The invention is based, in 
35 part, on the Applicant's discovery that the a2M receptor is a cell surface receptor for heat 
shock proteins. In particular, the AppUcant has shown that the heat shock protein gp96, 
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hsp90, hsp70, and calreticulin binds directly to the a2M receptor, and that a2M inhibits re- 
presentation of gp96, hsp90, hsp70, and caheticxilin-chaperoned antigenic peptides by 
macrophages. Because no precedent exists for receptors that recognize abundant and 
intracellular proteins like HSPs, the discovery of an HSP cell surface receptor was highly 
5 unexpected. 

The present invention provides compositions comprising complexes of HSPs and the 
a2M receptor, and antibodies and other molecules that biad the HSP-a2M receptor complex. 
The invention also encompasses methods for the use of the a2M receptor as a heat shock 
protein receptor, including methods for screening for compounds that modulate the 

1 0 interaction of HSP and the a2M receptor, and methods for treatment and detection of HSP- 
a2M receptor-mediated processes and HSP-a2M receptor-related disorders and conditions, 
such as autoimmune disorders, proliferative disorders and infectious diseases. 

The invention provides a method for identifying a compound that modulates an HSP- 
a2M receptor-mediated process, comprising: (a) contacting a test compound with a heat 

1 5 shock protein and an alpha (2) macroglobulin receptor; and (b) measuring the level of alpha 
(2) macroglobulin receptor activity or expression, such that if the level of activity or 
expression measured in (b) differs from the level of alpha (2) macroglobulin receptor activity 
in the absence of the test compound, then a compoimd that modulates an HSP-a2M 
receptor-mediated process is identified. In one embodiment of this method the compoxmd 

20 identified is an antagonist which interferes with the interaction of the heat shock protein with 
the alpha (2) macroglobulin receptor, further comprising the step of: (c) determining 
whether the level interferes with the interaction of the heat shock protein and the alpha(2) 
macroglobulin receptor. In another embodiment, the test compound is an antibody specific 
for the alpha (2) macroglobulin receptor. In another embodiment, the test compound is an 

25 antibody specific for alpha (2) macroglobulin. In another embodiment, test compound is an 
antibody specific for a heat shock protein. In another embodiment, the test compoimd is a 
small molecule. In another yet embodiment, the test compound is a peptide. In another 
embodiment, the peptide comprises at least 5 consecutive amino acids of the alpha (2) 
macroglobulin receptor. In yet another embodiment, the peptide comprises at least 5 

3Q consecutive amino acids of alpha (2) macroglobulin. In yet another embodiment, the peptide 
comprises at least 5 consecutive amino acids of a heat shock protein sequence. In another 
embodiment, the compoimd is an agonist which enhances the interaction of the heat shock 
protein with the alpha (2) macroglobulin receptor. In another embodiment, which the HSP- 
a2M receptor-mediated process affects an autoimmune disorder, a disease or disorder 

35 involving disruption of antigen presentation or endocytosis, a disease or disorder involving 
cytokine clearance or inflammation, a piolifemtive disorder, a viral disorder or other 
infectious disease, hypercholesterolemia, Alzheimer's disease, diabetes, or osteoporosis. 
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The invention also provides a method for identifying a compound that modulates an 
HSP-a2M receptor-mediated process, comprising: (a) contacting a test compound with a 
heat shock protein and an alpha (2) macroglobiilin receptor-expressing cell; and (b) 
measuring the level of alpha (2) macroglobulin receptor activity or expression in the cell, 

5 such that if the level of activity or expression measured in (b) differs from the level of alpha 
(2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. In yet another 
embodiment, wherein the alpha (2) macroglobulin receptor activity measured is the ability to 
interact with a heat shock protein. 

JO The invention also encompasses a method for identifying a compound that modulates 

the binding of a heat shock protein to the a2M receptor, comprising: (a) contacting a heat 
shock protein with an alpha (2) macroglobiilin receptor, or fragment, or analog, derivative or 
mimetic thereof, in the presence of a test compound; and (b) measuring the amotmt of heat 
shock protein bound to the alpha (2) macroglobulin receptor, or fragment, analog, derivative 

J 5 or mimetic thereolt such that if the amount of bound heat shock protein measured in (b) 
differs from the amount of bound heat shock protein measured in the absence of the test 
compound, then a compound that modulates the binding of an HSP to the a2M receptor is 
identified. In mother embodiment, alpha (2) macroglobulin receptor contacted in step (a) is 
on a cell surface. In another embodiment, the alpha (2) macroglobulin receptor is 

20 immobilized to a solid surface. In another embodiment, the solid surface is a microtiter dish. 
In another embodiment, the amount of bound heat shock protein is measured by contacting 
the cell with a heat shock protein-specific antibody. In yet another embodiment, the heat 
shock protein is labeled and the amount of bound heat shock protein is measured by 
detecting the label. In another embodiment, tiie heat shock protein is labeled with a 

25 fluorescent label. 

The invention further provides a method for identifying a compound that modulates 
heat shock protein-mediated antigen presentation by alpha (2) macroglobulin receptor- 
expressing cells comprising: (a) adding a test compound to a mixture of alpha (2) 
macroglobulin receptor-expressing cells and a complex consisting essentially of a heat shock 

30 protein noncovalently associated with an antigenic molecule, under conditions conducive to 
alpha (2) macroglobulin receptor-mediated endocytosis; (b) measuring the level of antigen- 
specific stunulation of cytotoxic T cells by alpha (2) macroglobulin receptor-expressing 
cells, such that if the level measured in (b) differs &om the level of said stimulation in the 
absence of the test compound, then a compound that modulates heat shock protein-mediated 

35 antigen presentation by alpha (2) macroglobulin receptor-expressing cells is identified. In 
one embodiment of this method, the step of measuring the level of the antigenic molecule 
presented on the cell surface of step (b) comprises: (i) adding the alpha (2) macroglobulin 
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recq>tor-expressmg cells fonned in step (a) to T cells under conditions conducive to the 
activation of the T cells; and (u) comparing the level of activation of said cytotoxic T cells 
with the level of activation of T cells by an alpha (2) macroglobulin receptor-expressing cell 
formed in the absence of the test compoimd, wherein an increase of decrease in level of T 

5 cell activation indicates that a compound that modulates heat shock protein-mediated antigen 
presentation by alpha (2) macroglobulin receptor-expressing cells is identified. 

In various embodiments, the heat shock protein used in the methods of the invention 
is gp96. Alternatively, the heat shock proteins hsp90, hsp70, or caketiculin may be used in 
various embodiments of the invention. 

10 In another embodiment, the invention provides a method for detecting a heat shock 

protein-alpha (2) macroglobulin receptor-related disorder in a mammal comprising 
measuring the level of an HSP-alpha (2) macroglobulin receptor-mediated process in a 
patient sample, such that if the measured level differs from the level found in clinically 
normal individuals, then a heat shock protein-alpha (2) macroglobulm receptor-related 

15 disorder is detected. 

The invention also encompasses kits comprising compositions of the invention. In 
one embodiment, a kit is provided, packaged in one or more containers, comprising: (a) a 
purified heat shock protein, nucleic acid encoding a heat shock protein, or cell expressing a 
heat shock protein; and (b) an alpha (2) macroglobulin receptor polypeptide, nucleic acid 

20 encoding an alpha (2) macroglobulin receptor polypeptide, or cell expressing an alpha (2) 
macroglobulin receptor polypeptide. In one embodiment, the kit the alpha (2) macroglobulin 
receptor polypeptide, nucleic acid encoding an alpha (2) macroglobulin receptor polypeptide, 
or cell expressing an alpha (2) macroglobulin receptor polypeptide is purified. In another 
embodiment, the kit further comprises instmctions for use in treating an autoimmune 

25 disorder, an infectious disease, or a proliferative disorder. 

The invention also provides a method for modulating an immune response 
comprising administering to a mammal a purified compound that modulates the interaction 
of a heat shock protein with the alpha (2) macroglobulin receptor. In one embodiment, the 
compound is an agonist which enhances the interaction of the heat shock protein and the 

30 alpha (2) macroglobulin receptor. In another embodiment of this method the compoimd in 
an antagonist that interferes with the interaction between the heat shock protein and the a2M 
receptor. 

The invention fiirther provides a method for treating an autoimmune disorder 
comprismg administering to a mammal in need of such treatment a purified compound that 
35 interferes with the interaction of a heat shock protein with the alpha (2) macroglobulin 
receptor. In one embodiment of this method the compound in an antagonist that interferes 
wifli the interaction between the heat shock protem and the a2M receptor. In one 
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embodiment, the antagonist is an antibody specific for alpha (2) macroglobulin receptor. In 
another embodiment, the antagonist is an antibody specific for a heat shock protein. In 
another embodiment, the antagonist is a small molecule. In another embodiment, the 
antagonist is a peptide. In another embodiment, the peptide comprises at least 5 consecutive 
5 amino acids of alpha (2) macroglobulin receptor. In another embodiment, the peptide 
comprises at least 5 consecutive amino acids of alpha (2) macroglobulin. In another 
embodiment, the peptide comprises at least 5 consecutive amino acids of a heat shock protein 
sequence. 

The invention further provides a method for increasing the iimnunopotency of a 
IQ cancer cell or an infected cell comprising transforming said cell with a nucleic acid 

comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes 
an alpha (2) macroglobulin receptor polypeptide. 

Still further, the invention provides a method for increasing the immunopotency of a 
cancer cell or an mfected cell comprising: (a) transforming said cell with a nucleic acid 
1 5 comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes 
an alpha (2) macroglobulin receptor polypeptide, and (b) administering said cell to an 
individual in need of treatment, so as to obtain an elevated immune response. 

The invention also provides a recombinant cancer cell transfonned with a nucleic 
acid comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) 
20 encodes an alpha (2) macroglobulin receptor polypeptide. In one embodiment, the 
recombinant cell is a human cell. 

In yet another embodiment, the invention provides a recombinant infected cell 
transformed with a nucleic acid comprising a nucleotide sequence that (i) is operably linked 
to a promoter, and (ii) encodes an alpha (2) macroglobulin receptor polypeptide. In one 
25 embodiment, the recombinant cell is a hiunan cell. 

In another embodiment, the invention provides a method for screening for molecules 
that specifically bind to an a2M receptor comprising the steps of: (a) contacting an a2M 
receptor with one or more test molecules under conditions conducive to binding; and 
(b) determining whether any of said test molecules specifically bind to the a2M receptor. 
30 In one embodiment of this method, test molecules are potential immunotherapeutic drugs. 

The invention also provides a method for identifying a compound that modulates the 
binding of an a2M receptor ligand to the a2M receptor comprising: contacting an a2M 
receptor with an a2M receptor ligand, or an a2M receptor-binding firagment, analog, 
derivative, or mimetic thereof, in the presence of one or more test compound; and (b) 
35 measuring the. amount of a2M receptor ligand, or fragment, analog, derivative or mimetic 
thereof, bound to the a2M receptor, such that if the amount of bound a2M receptor ligand 
measured in (b) differs from the amount of bound a2M measured in the absence of the test 
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compound, then a compound that modulates Qie binding of an a2M receptor ligand to the 
a2M receptor is identified. 

In another embodiment, a method is provided for identifying a compound that 
modulates the interaction between the a2M receptor and an a2M receptor ligand, comprising: 
5 (a) contacting an a2M receptor with one or more test compounds; and (b) measuring the 
level of a2M receptor activity or expression, such that if the level of activity or expression 
measured in (b) differs from the level of a2M receptor activity in the absence of one or more 
test compounds, then a compound that modulates the interaction between the a2M receptor 
and an a2M receptor Ugand is identified. In one embodiment, the a2M receptor hgand is 



In another embodiment, a method is provided for identifying a compound that 
modulates antigen presentation by a2M receptor-expressing cells comprising: (a) adding one 
or more test compounds to a mixture of a2M receptor-expressing cells and a complex 
comprising an a2M receptor ligand and an antigenic molecule, under conditions conducive to 

15 a2M receptor-mediated endocytosis; (b) measuring the level of stimulation of antigen- 
specific cytotoxic T cells by the a2M receptor-expressing cells, such fliat if the level 
measured in (b) differs &om the level of said stimulation in the absence of the one or more 
test compounds, then a compound that modulates antigen presentation by a2M receptor- 
^ressmg cells is identified. 

20 Iq another embodiment, the invention provides a method for modulating an immune 

response comprising administering to a mammal a purified compound that binds to the a2M 
receptor in an amount effective to modulate an inmnme response in the mammal. 

In yet another embodiment, a method for treating or preventing a disease or disorder 
is provided comprising administering to a manomal a purified compound that binds to the " 

25 o2M receptor in an amount effective to treat or prevent a disease or disorder in the manMnal. 
In one embodiment, the disease or disorder is cancer or an infectious disease. 

In a further embodiment, a method is provided for treating an autoimmune disorder 
comprising administering to a mammal in need of such treatment a purified compound that 
binds to the a2M receptor in an amount effective to treat an autoimmune disorder in the 

30 mammal. 

In another aspect of the invention, a method is provided for stimulating an immune 
response in a patient comprising administering to said patient blood which has been 
withdrawn from said patient and treated to remove an a2M receptor ligand. In a specific 
embodiment, the method further comprises administering to said patient a heat shock protein 
35 or a heat shock protein-antigenic peptide complex. In a ^ecific embodiment, blood is 
administered to said patient by syringe. In another embodiment, said blood is administered 
to said patient by an intravenous drip. 
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In another embodiment, a method is provided for stimulating an immune response in 
a patient comprising: a) removing a a2M receptor Ugand from blood withdrawn from said 
patient; and b) returning at least a portion of the a2M receptor ligand-depleted blood to said 
patient 

5 In another embodiment, a method is provided for stimulating an immime response in 

a patient comprising: a) withdrawing blood from said patient; b) removing a a2M receptor 
ligand from said blood; and c) returning at least a portion of the a2M receptor hgand- 
depleted blood to said patient. In a specific embodiment, the method ftirther comprises after 
step (a) and before step (c) the step of adding a heat shock protein or a heat shock protein 

10 antigenic-peptide complex to said blood. In a specific embodiment, said blood is returned to 
said patient by syringe. In another specific embodiment, said blood is retumed to said 
patient by an intravenous drip. In another specific embodiment, the removing a a2M 
receptor Ugand fix)m the blood comprises the step of contacting the blood with a solid phase 
attached to a a2M receptor ligand-binding molecule for a time period and under conditions 

15 suflScient to allow binding of a2M receptor ligand to the a2M receptor ligand-binding 
molecule solid phase. In another specific embodiment, the a2M receptor ligand-binding 
molecule is a2M receptor, or a firagment thereof In another embodiment, said a2M receptor 
ligand-binding molecule does not bind a heat shock protein. In another embodiment, the 
a2M receptor ligand-binding molecule is an a2M receptor ligand-specific antibody, or a 

20 fragment thereof. 

In various embodiments, an apheresis system is used in said removing stq). In other 
embodiments blood is withdrawn manually in said withdrawing step. In various 
embodiments, said removing step comprises separating the plasma from said blood and 
treating said plasma to remove said a2M receptor ligand. 

25 The invention ftirther provides a kit comprising in one or more containers a solid 

phase chromatography column with a purified a2M receptor ligand binding molecule 
attached thereto, such that withdrawn blood can be run over the column to deplete the blood 
of a a2M receptor ligand. In one embodiment, the a2M receptor Ugand binding molecule of 
the kit does not bind heat shock proteins. 

30 In various embodiments, the a2M receptor Ugand is a2M, a lipoprotein complex, 

lactoferrin, tissue-type plasminogen activator, urokinase-type plasminogen activator, or an 
exotoxin. 

The term '*HSP-et2M receptor-mediated process" as used herein refers to a process 
dependent and/or responsive, either directly or indirectly, to the interaction of HSP with the 
35 a2M receptor. Such processes include processes that result fix>m an aberrant level of 

expression, synthesis and/or activity of a2M receptor, such as endocytic activities relating to 
the binding of the various a2M Ugands, including but not limited to HSP, a2M, Upoprotein 
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complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type plasminogen 
activator (uP A), and exotoxins. Such processes include, but are not limited to, endocytosis, 
antigen presentation, cholesterol regulation, apoE-containing lipoprotein clearance, and 
chylomicron remnant removal. 

5 The terms "HSP-a2M receptor-related disorder" and "HSP-a2M receptor-related 

condition", as used herein, refers to a disorder and a condition, respectively, involving a 
HSP-a2M receptor interaction. Such disorders and conditions may result, for example, jfrom 
an aberrant ability of the a2M receptor to interact with HSP, perhaps due to aberrant levels of 
HSP and/or a2M receptor expression, synthesis and/or activity relative to levels found in 

10 normal, unaffected, unimpaired individuals, levels found m clinically normal individuals, 
and/or levels found in a population whose levels represent a baseline, average HSP and/or 
a2M receptor levels. Such disorders include, but are not limited to, autoimmune disorders, 
diseases and disorders involving disruption of antigen presentation and/or endoqrtosis, 
diseases and disorders involving cytokine clearance and/or inflammation, proliferative 

15 disorders, viral disorders and other infectious diseases, hypercholesterolemia, Alzheimer's 
disease, diabetes, and osteoporosis. 

The term "a2MR ligand" as used herein, refers to a molecule capable of binding to 
the a2M receptor. Such a2MR ligands include as well as known ligands, such as, but not 
limited to, a2M and a2M complexes, heat shock proteins and heat shock protein complexes, 

20 Kpoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type 
plasminogen activator (uPA), and exotoxins. In addition, a2MR ligands also include 
molecules which can readily be identified as a2MR ligands using standard binding assays 
well known in the art. Such a2MR ligands are typically endocytosed by cell upon binding to 
the a2M receptor. 

25 

4. BMEFDESOOTTIONOFTHEHGURES 

PIG. lA-C, Identification of an 80 kDa polypeptide as a putative gp96 receptor. A. 
Confocal microscopy of re-presentation-competent RAW264.7 cells stained with gp96-FITC 
(left panel) and with albumm-FrrC (right panel). B. SDS-P AGE analysis of detergent 
extracts of plasma membranes fix)m surface biotmylated RAW264.7 (re-presentation- 
competent) or P815 cells (representation-incompetent) eluted firom gp96 or albumin- 
Sepharose (S A) columns and stained with silver stain (top) or avidin-peroxidase (bottom). 
C gp96-S ASD-I*^^ was cross-hnked to live peritoneal macrophages (MO) or P8 1 5 cells, 
and the cell lysates examined by SDS-P AGE and autoradiography. Various components were 
omitted as controls, as indicated. 
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FIG* 2A-B. Anti-p80 antiserum detects an 80 kDa molecule and inhibits re-presentation of 
gp96-chaperoned AHl peptide by macrophage. A. Pre-inmiime and immune sera were used 
to probe blots of plasma membrane extracts of RAW264.7, peritoneal macrophages (both 
cell types re-presentation-competent), or P815 cells. B. Re-presentation of gp96-chaperoned 
5 peptide AHl . Sera were added at the jBnal dilution indicated. The solid cross indicates the 
level of T cell stimulation when the APCs were pulsed directly with the AHl peptide. The 
open cross indicates the corresponding value with unpulsed APCs. 

FIG. 3A-C, Protein microsequencing of the 80 kDa protein. A. Analysis of a single tryptic 
10 (GALHIYHQR) peptide by tandem- mass spectrometry. All possible b- and y-ion series 
together with identified b-ion series (red) and y-ion series (blue) are shown. B. Collision- 
induced dissociation (CID) spectrum of this peptide is shown. C. Four identified peptides 
from the a2M receptor, peptide mass, and sequence are shown. 

15 BIG. 4. a2-Macroglobulin inhibits re-presentation of gp96-chapeioned AHl peptide by 
macrophage. The solid cross indicates the level of T cell stimulation when the APCs were 
pulsed directly with the AHl peptide. The open cross indicates the corresponding value with 
unpulsed APCs. 

20 FIG. 5. Table of specific binding of HSPs and a2-macroglobulin to primary cultures and 
cell lines of several histological origins. The indicates percentage of cells staining with 
FITC over background staining alone. The indicates that the cells were examined by 
confocal microscopy. All CDl Ic"^ cells were intensely positive for binding to the three HSPs 
and a2M.. 

25 

JIG. 6A-B. Analysis of cells by flow cytometry for the presence of FITC labelled cells. 
The macrophage cell lines RAW264.7 (A) or RAW309Cr.l (B) were incubated with 
lOOmg/ml of FTTC labeled gp96, hsp90, hsp70 or SA. Live cells only were gated based on 
FSC. 

30 

FIG. 7A-B. Re-presentation of gp96-chaperoned peptides by APCs that bind HSPs and a2 
macroglobulin. The presence of IFN-y (pg/ml) was assayed as a marker for CTL stimulation. 
(A) Peritoneal macrophage or BM-DCs from C57B1/6 mice (1X104). (B) RAW 264.7 or 
RAW 309Cr.l macrophage lines were cultured with gp96 (40 mg/ml) by itself or complexed 
35 to the AHl-19 peptide and used to stimulate AHl specific CTLs (1X104). 
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FIG. 8. Peptides chaperoned by hsp90, CRT, hsp70 and gp96 but not serum albumin are 
re-presented by RAW264.7 cells. The chaperones, uncomplexed or complexed to the 
AHl-19 peptide were used to pulse RAW264.7 cells which were tested for their ability to 
stimulate cognate CTLs. 

5 

FIG. 9A-C. Gp96, hsp90, hsp70 and caketiculin utilize a common receptor for 
re-presentation. (A) RAW264.7 cells were pulsed with gp96-AHl-19 complexes (40 mg/ml 
gp96) in presence of increasing concentrations of uncomplexed gp96, hsp90, hsp70 or SA. 
(B) Re-presentation of AHl-19 complexed to gp96, hsp90, hsp70, CRT or albumin was 
10 carried out in presence of increasing concentrations of a2-macroglobulin. The data is plotted 
as percentage inhibition of re-presentation. (C) Re-presentation of AHl-19 complexed to 
gp96, hsp90, hsp70 or caketiculin in presence of increasing concentrations of anti-CD91 
antibody. The data is plotted as percentage inhibition of re-presentation. 

15 FIG. lOA-C. Re-presentation of gp96-chaperoned pq)tides follows the classical endogenous 
antigen presentation pathway. (A) Requirement of proteasomes. Peritoneal macrophage 
(1X106 ) were either treated or untreated with lactacystin (100 mM). The cells were labeled 
with chromium and used as targets against VSV8 specific CTLs. (B) Requirement of TAP as 
measured in vitro. Peritoneal macrophage from TAP+/+ or TAP-/- mice were cultured with 

20 gp96 or gp96-VSV19 complex and VSV8 specific CTL line. Culture supematants were 
tested for the presence of IFN-y (pg/ml) as a marker for CTL stimulation. (C) Requirement 
of TAP as measured in vivo. Gp96-VSV19 complex was injected intraperitoneally. After 10 
days, spleens were removed and cells were cultured in vitro with VSV8. The lymphocyte 
cultures were tested for their ability to lyse EL4 cells (dotted line) or EL4 cells pulsed with 

25 VS V8 peptide (solid line). Each line re-presents one mouse. 

FIG. 11. a2M recq)tor is a sensor of necrotic cell death due to its ability to detect 
extracellular gp96. Conversely, receptors ^sR) for phosphatidyl serine (ps) detect apoptotic 
cell death. 

30 

FIG, 12A. The mouse a2MR cDNA (SEQ ED N0:1) and predicted open reading frame of 
murine a2MR protem (Genbank accession no. CAA47817). The murine a2M protein 
(SEQ ID N0:2), with residues identified by microsequencmg an 80 kDa, gp96-interacting 
fragment of the receptor highlighted in bold. 

35 

FIG. 13A. The human a2M cDNA (SEQ ID N0:3) and predicted open reading frame of 
a2M protein (SEQ ID NO:4)(Genbank accession no. Ml 1313). B. The sequence of the 
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mature human a2M protein (SEQ ID N0:5), following cleavage of the N-terminal 23 amino 
acid signal sequence. Highlighted residues represent the 138 amino acid a2MR-binding 
domain (RBD). Underlined residues represent an extension of the RBD that is present in a 
a2MR-binding, proteolytic fragment of a2M (RBDv). Bolded residues have been shown to 
5 be important for a2MR binding. Italicized residues represent a domain that is conserved 
among ligands of a2MR. 

FIG. 14A. The human a2MR cDNA (SEQ ID N0:6) and predicted open reading frame of 
human a2MR protein (Genbank accession no. NP_002323). B. Primary amino acid sequence 

10 of human a2MR (SEQ ID N0:7). The approximate locations of complement repeat clusters 
I and n are highlighted in grey. Individual complement repeats of Cl-II are indicated as 
follows: amino acids of CR3, 5, 7 and 9 are in italics, and amino acids of CR4, 6, 8, and 10 
are underlined. Amino acids highlighted in bold were present in an 80kDa peptide fragment 
of flie mouse a2MR that bound to gp96. The double underlined residues represent the 

15 predicted signal peptide. For the locations ofother features ofthe receptor, such as the EGF 
repeats, see the article by (Herz et al, 1988, EMBO J. 7:41 19-4127). 



5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to compositions and methods for the use ofthe alpha (2) 
macroglobulin receptor (also referred to interchangeably herein as "a2MR" or "the a2M 
receptor") as a heat shock protein ("HSP") receptor. In particular, the present invention 
provides compositions comprising isolated a2MR- Ugand complexes, e.g., a2MR-HSP 
complexes, including isolated and/or recombinant cells, and antibodies, molecules and 
compounds that modulate the interaction of a2MR with an a2MR ligand, such as HSP. The 
invention further encompasses methods for the use of a2MR as a heat shock protein recqptor, 
including screening assays to identify compounds that modulate ttie interaction of a2MR 
with an HSP, or other a2MR ligand, and methods for the use of these molecules and 
complexes for the diagnosis and treatment of immune disorders, proliferative disorders, and 
infectious diseases. 

The term "a2MR ligand" as used herein, refers to a moleciile capable of bmding to 
the a2M receptor. Such a2MR ligands include as well as known ligands, such as, but not 
limited to, a2M and a2M complexes, heat shock proteins and heat shock protein complexes, 
lipoprotein complexes, lactofeirin, tissue-type plasmmogen activator (tPA), urokinase-type 
plasminogen activator (uPA), and exotoxins. In addition, a2MR ligands also include 
molecules which can readily be identified as a2MR ligands using standard binding assays 



-16- 



wo 01/92474 



PCTAJSOl/18041 



well known in the art. Such a2MR ligands are typically endocytosed by cell upon binding to 
a2MR. 

An HSP useful in the practice of the invention may be selected from among any 
cellular protein that satisfies any one of the following criteria: the intracellular concentration 

5 of an HSP increases when a cell is exposed to a stressful stimulus; an HSP can bind other 
proteins or peptides, and can release the bound proteins or peptides in the presence of 
adenosine triphosphate (ATP) or low pH; or an HSP possesses at least 35% homology with 
any cellular protein having any of the above properties. Preferably, the HSP used in the 
compositions and methods of the present invention includes, but are not limited to, HSP90, 

10 gp96, BiP, Hsp70, DnaK, Hsc70, PhoE calreticulin, PDI, or an sHsp, alone or in 
combination. 

In a preferred embodiment, an HSP is a mammalian (eg., mouse, rat, primate, 
domestic animal such as dog, cat, cow, horse), and is most preferably, human. 

Hsps useful in the practice of the invention include, but are not limited to, manbers 
15 of the HSP60 family, HSP70 family, HSP90 family, HSPlOO family, sHSP family, 

cahreticulin, PDI, and other proteins in the endoplasmic reticulum that contain thioredoxin- 
like domain(s), such as, but not limited to, ERp72 and ERp61 . 

HSP analogs, muteins, derivatives, and fragments can also be used in place of HSPs 
accordmg to the invention. An HSP peptide-binding "fragment" for use in the invention 
20 refers to a polypeptide comprising a HSP peptide-binding domain that is capable of 
becoming non-covalently associated with a peptide to form a complex that is capable of 
eliciting an immune response. In one embodiment, an HSP peptide-binding fragment is a 
polypeptide comprising an HSP peptide-binding domain of approximately 100 to 200 amino 
acids. 

25 Databases can also be searched to identify sequences with various degrees of 

similarities to a query sequence using programs, such as FASTA and BLAST, which rank the 
similar sequences by alignment scores and statistics. Such nucleotide sequences of non- 
limiting examples of HSPs that can be used for preparation of the HSPs used in the methods 
of the invention are as follows: human Hsp70, Genbank Accession No. NM__005345, 

30 Sargent ei al, 1989, Proc. Natl. Acad. Sci. U.S.A., 86:1968-1972; human Hsp90, Genbank 
Accession No. X15183, Yamazaki et al, Nucl. Acids Res. 17:7108; human gp96: Genbank 
Accession No. X15187, Maki et al, 1990, Proc. Natl. Acad Sci., 87: 5658-5562; human BiP: 
Genbank Accession No. M19645; Ting et al, 1988, DNA 7: 275-286; human Hsp27, 
Genbank Accession No. M24743; Hickey et al, 1986, Nucleic Acids Res. 14:4127-45; 

35 mouse Hsp70: Genbank Accession No. M35021, Hunt et al, 1990, Gene, 87:199-204; 
mouse gp96: Genbank Accession No. M16370, Srivastava et al, 1987, Proc. Natl. Acad. 
Sci., 85:3807-3811; and mouse BiP: Genbank Accession No. U16277, Haas et aLy 1988, 
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Proc. Natl. Acad. Sci. U.S. A., 85: 2250-2254. Due to the degeneracy of the gaietic code, the 
term "HSP sequence", as used herein, refers not only to the naturally occurring amino acid 
and nucleotide sequence but also encompasses all liie other degenerate sequences that encode 
the HSP. 

5 The aforementioned HSP families also contain proteins that are related to HSPs in 

sequence, for example, having greater than 35% amino acid identity, but whose expression 
levels are not altered by stress. Therefore, it is contemplated that the definition of heat shock 
or stress protein, as used herein, embraces other proteins, mutants, analogs, and variants 
thereof having at least 35% to 55%, preferably 55% to 75%, and most preferably 75% to 

1 0 85% amino acid identity with members of these families whose expression levels in a cell are 
enhanced in response to a stressful stimulus. The determination of percent identity between 
two sequences can also be accomplished using a mathematical algori&m. A preferred, 
non-limiting example of a malhematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 

15 87:2264-2268, modified as in Karlin and Altschul, 1993, Proc. Natl. Acad. Sci. USA 

90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs 
of Altschul, et al., 1990, J. Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide 
sequences homologous to a nucleic acid molecules of the invention. BLAST protein 

20 searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain 
amino acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes. Gapped BLAST can be utilized as described in 
Altschul et al, 1997, Nucleic Acids Res.25:33 89-3402. Alternatively, PSI-Blast can be used 
to perform an iterated search which detects distant relationships between molecules (Altschul 

25 et al, 1997, supra). When utilizing BLAST, Gapped BLAST, and PSLBlast programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used 
(see http://www.ncbi.nlm.nih.gov). Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers 
and Miller, 1988, CABIOS 4:1 1-17. Such an algorithm is incorporated into the ALIGN 

3Q program (version 2.0) which is part of the GCG sequence alignment software package. 

When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

The immunogenic HSP-peptide complexes of the invention may include any complex 
containing an HSP and a peptide that is capable of inducing an immune response in a 

35 mammal. The peptides are preferably noncovalently associated with the HSP. Preferred 
complexes may include, but are not limited to, gp96-peptide complexes, HSP90-peptide 
complexes, HSP70-peptide complexes, HSP60-peptide complexes, HSPlOO-peptide 
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complexes, calreticulin-peptide complexes, and sHSP-peptide complexes. For example, the 
HSP gp96 which is present in the endoplasmic reticulum of eukaiyotic cells and is related to 
the cytoplasmic HSP90's can be used to generate an effective vaccine containing a gp96- 
peptide complex. 

5 The HSPs, a2MR, and/or antigenic molecules for use m the invention can be purified 

firom natural sources, chemically synthesized, or recombmantly produced. Although the 
HSPs may be allogeneic to the patient, in a preferred embodiment, the HSPs are autologous 
to the patient to whom they are administered. 

10 5.1 COMPOSITIONS OF THE INVENTION 

The present invention provides compositions that modxilate the interaction between 
a2MR and an a2MR ligand, such as, for example, an HSP. Such compositions can be used 
in methods to elicit or modulate an immxme response. Such compositions also include 
antibodies that specifically recognize HSP- a2MR complexes, isolated cells that express 

15 HSP-a2MR complexes, and isolated and recombinant cells that contain recombinant a2MR 
and HSP sequences. In addition, in various methods of the invention, sequences encoding 
a2MR, an HSP, and a2M are used for immunotherapy. Such compositions can be used, for 
example, in immimoflierapy against proliferative disorders, infectious diseases, and other 
HSP-a2MR-related disorders. Methods for the synthesis and production of such 

20 compositions are described herein. 

5.1.1 RECOMBINANT EXPRESSION 

In various embodiments of the invention, sequences encoding the a2MR, an HSP, 
a2M, or other a2AdR ligand are inserted into an expression vector for propagation and 
25 expression in recombinant cells. Thus, in one embodiment, the a2M receptor, HSP, a2M, or 
other a2MR ligand coding region is linked to a non-native promoter for expression in 
recombinant cells. 

The amino acid sequence of the portion of a2MR that recognizes and binds to HSPs 
is shown in FIG. 12B (SEQ ID N0:2). Based on the discovery by the Applicant, this 

30 portion of a2MR is responsible for recognizing and binding to HSPs and HSP-antigenic 
peptide complexes. After binding HSPs, a2MR facilitates transport of the HSP-antigenic 
peptide complex into the cell, where the peptide antigens associate with MHC class I 
molecules and are then presented on the cell surface of the cell, and become available to 
stimulate an immune response. Based on this invention, compositions comprising agonists 

35 and antagonists of a2MR and HSPs interactions can be used to modulate the immune 

response. Thus, recombinant a2MR polypeptides, complexes of a2MR and an HSP or HSP- 
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antigenic peptide complexes, and recombinant cells expressing a2MR or complexes 
comprising a2MR and antigenic peptides can be used in methods for immunotherapy and 
diagnostic methods described herein. 

In various «nbodiments of the invention, sequences encoding the a2MR, and/or a 

5 heat shock protein or a2M, or fragments thereof, are inserted into an expression vector for 
propagation and expression in recombinant cells. An expression construct, as used herein, 
refers to a nucleotide sequence encoding a particular gene product, such as the a2MR, HSP 
or a2M, operably associated with one or more regulatory regions which allows expression of 
the encoded gene product in an appropriate host cell. "Operably-associated" refers to an 

jQ association in which the regulatory regions and the nucleotide sequence encoding the gene 
product to be expressed are joined and positioned in such a way as to permit transcription, 
and ultimately, translation. 

The DNA may be obtained from known sequences derived from sequence databases 
by standard procedures known in the art by DNA amplification or molecular cloning directly 

1 5 from a tissue, cell culture, or cloned DNA (e.g. , a DNA "library"). Any eukaryotic cell may 
serve as the nucleic acid source for obtaining the coding region of an hsp gene. Nucleic acid 
sequences encoding HSPs can be isolated from vertebrate, mammalian, as well as primate 
sources, including humans. Clones derived from gmomic DNA may contain regulatory and 
intron DNA regions in addition to coding regions; clones derived from cDNA will contain 

2Q only exon sequences. Whatever the source, the hsp gene should be cloned into a suitable 
vector for propagation of the gene. 

Vectors based on E. coli are the most popular and versatile systems for high level 
expression of foreign proteins (Makrides, 1996, Microbiol Rev, 60:512-538). Non-limiting 
examples of regulatory regions that can be used for expression m E, coli may include but not 

25 Umited to foe, trp, Ipp, phoA, recA, tac, XP^ and phage T3 and T7 promoters (Makrides, 
1996, Microbiol Rev, 60:512-538), Non-limitmg examples of prokaryotic expression vectors 
may mclude flie Xgt vector series such as 1 (Huynh et al., 1984 in "DNA Cloning 
Techniques", Vol. I: A Practical Approach (D. Glover, ed.), pp. 49-78, IRL Press, Oxford), 
and the pET vector series (Studier et al., 1990, Methods EnzymoL, 185:60-89). However, a 

3Q potential drawback of a prokaryotic host-vector system is the inability to perform many of 
the post-translational processing events of mammalian cells. Thus, an eukaryotic host-vector 
system is preferred, a mammalian host-vector system is more preferred, and a human host- 
vector system is the most preferred. 

The regulatory regions necessary for transcription of an a2MR sequence, for 

35 example, can be provided by the expression vector. A translation initiation codon (ATG) 
may also be provided to express a nucleotide sequence encoding an a2M receptor that lacks 
an initiation codon. In a compatible host-construct system, cellular proteins required for 
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transcription, such as RNA polymerase and transcription factors, will bind to the regulatory 
regions on the expression construct to effect transcription of the a2MR sequence in the host 
organism. The precise nature of the regulatory regions needed for gene expression may vary 
from host cell to host cell. Generally, a promoter is required which is capable of binding 

5 RNA polymerase to initiate the transcription of an operably-associated nucleic acid 

sequence. Such regulatory regions may include those 5'-non-coding sequences involved with 
initiation of transcription and translation, such as the TATA box, the cap site, a CAAT box, 
and the like. The non-coding region 3' to the coding sequence may contain transcriptional 
termination regulatory sequences, such as terminators and polyadenylation sites. 

1 0 Both constitutive and inducible regulatory regions may be used for expression of the 

a2M receptor, HSP, a2M, or other a2MR ligand. It may be desirable to use inducible 
promoters when the conditions optimal for growth of the recombinant cells and the 
conditions for high level expression of the gene product are different. Examples of useful 
regulatory regions are provided in the next section below. 

1 5 For e:q)ression of the a2M receptor, HSP, a2M, or other a2MR ligand gene product 

in mammalian host cells, a variety of regulatory regions can be used, for example, flie SV40 
early and late promoters, the cytomegalovirus (CMV) immediate early promoter, and the 
Rous sarcoma virus long terminal repeat (RS V-LTR) promoter. Inducible promoters that 
may be useful in mammalian cells include but are not limited to those associated with the 

20 metallotiiionein n gene, mouse mammary tumor virus glucocorticoid responsive long 
terminal repeats (MMTV-LTR), the p-interferon gene, and the Hsp70 gene (Williams et al, 
1989, Cancer Res. 49:2735-42; Taylor et al, 1990, Mol. Cell BioL, 10:165-75). It may be 
advantageous to use heat shock promoters or stress promoters to drive expression of a2MR 
in recombinant host cells. 

25 The following animal regulatory regions, which exhibit tissue specificity and have 

been utilized in transgenic animals, can also be used in tumor cells of a particular tissue type: 
elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, 
Cell 38:639-646; Omitz et al., 1986, Cold Spring Harbor Symp. Quant. BioL 50:399-409; 
MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in 

30 pancreatic beta cells (Hanahan, 1985, Nature 315:115-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames 
et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is 

35 active in liver (Pinkert et al., 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. BioL 5:1639-1648; 
Hammer et al., 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is 
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active in the liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
Kollias et al., 1986, Cell 46:89-94; myelin basic protein gene control region which is active 
in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703-712); myosin Ught 

5 chain-2 gene control region which is active in skeletal muscle (Sani, 1985, Nature 3 14:283- 
286), and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et al., 1986, Science 234:1372-1378). 

The efficiency of expression of the a2M receptor in a host cell may be enhanced by 
the inclusion of appropriate transcription enhancer elements in the expression vector, such as 

10 those found in SV40 virus, Hepatitis B virus, cytomegalovirus, immunoglobulin genes, 
metallothionein, P-actin (see Bittner et al., 1987, Methods in EnzymoL 153:516-544; 
Gorman, 1990, Cuir. Op. in Biotechnol. 1:36-47). 

The expression vector may also contain sequences that permit maintenance and 
replication of the vector in more than one type of host cell, or integration of the vector into 

15 the host chromosome. Such sequences may include but are not limited to replication origins, 
autonomously replicating sequences (ARS), centromere DNA, and telomere DNA. It may 
also be advantageous to use shuttle vectors that can be replicated and maintained in at least 
two types of host cells. 

In addition, the expression vector may contain selectable or screenable marker genes 

20 for initially isolating or identifying host cells that contain DNA encoding an a2M receptor. 
For long term, high yield production of a2M receptor, stable expression in mammalian cells 
is preferred. A number of selection systems may be used for mammalian cells, including, but 
not limited, to the Herpes simplex virus thymidine kinase (Wigler et al., 1977, Cell 1 1 :223), 
hypoxanthine-guaninephosphoribosyltransferase (Szybalski and Szybalski, 1962, Proc. Natl. 

25 Acad. Sci. USA 48:2026), and adenine phosphoribosyltransferase (Lowy et al., 1980, Cell 
22:817) genes can be employed in tk\ hgprf or aprf cells, respectively. Also, antimetaboUte 
resistance can be used as the basis of selection for dihydrofolate reductase (dhfr), which 
confers resistance to methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 77:3567; 
O'Hare et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to 

30 mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neomycin 
phosphotransferase (neo), which confers resistance to the aminoglycoside G-418 (Colbeire- 
Garapin et al., 1981, J. Mol. Biol. 150:1); and hygromycin phosphotransferase (hyg), which 
confers resistance to hygromycin (Santerre et al., 1984, Gene 30:147). Other selectable 
markers, such as but not limited to histidinol and Zeocin™ can also be used. 

35 In order to insert the DNA sequence encoding a2M receptor, HSP, a2M, or other 

a2MR ligand into the cloning site of a vector, DNA sequences with regulatory functions, 
such as promoters, must be attached to DNA sequences encoding the a2M receptor, HSP, 
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a2M, or other a2MR ligand, respectively. To do this, linkers or ad^ters providing the 
appropriate compatible restriction sites may be ligated to the ends of cDNA or synthetic 
DNA encoding an a2M receptor, by techniques well known in the art (Wu et al., 1987, 
Methods in Enzyraol 152:343-349). Cleavage with a restriction enzyme can be followed by 
5 modification to create blunt ends by digesting back or filling in single-stranded DNA termini 
before ligation. Alternatively, a desired restriction enzyme site can be introduced into a 
fi*agraent of DNA by amplification of the DNA by use of PGR with primers containing the 
desired restriction enzyme site. 



1 0 operably associated with regulatory regions can be directly introduced into appropriate host 
cells for expression and production of a2MR without fijrther cloning (see, for example, U.S. 
Patent No. 5,580,859). The expression constructs may also contain DNA sequences that 
facilitate integration of the a2M receptor sequence into the genome of the host cell, e.g., via 
homologous recombinatioiL In this instance, it is not necessary to employ an expression 

1 5 vector comprising a replication origin suitable for appropriate host cells in order to propagate 
and express the a2M receptor in the host cells. 

Expression constructs containing cloned nucleotide sequence encoding the a2M 
receptor, an HSP, a2M, or other a2MR ligand, can be introduced into the host ciell by a 
variety of techniques known in the art, including but not limited to, for prokaryotic cells, 

20 bacterial transformation (Hanahan, 1985, in DNA Cloning, A Practical Approach, 1:109- 
136), and for eukaryotic cells, calcium phosphate mediated transfection (Wigler et al., 1977, 
Cell 1 1 :223-232), liposome-mediated transfection (Schaefer-Ridder et al., 1982, Science 
215:166-168), electroporation (Wolff et al., 1987, ProcNatl Acad Sci 84:3344), and 
microinjection (Cappechi, 1980, Cell 22:479-488). 

25 For long term, high yield production of properly processed a2M receptor, HSP, a2M, 

or other a2MR ligand, stable expression in mammalian cells is preferred. Cell lines that 
stably express the a2M receptor, HSP, a2M, or other a2MR ligand or a2MR-peptide 
complexes may be engineered by using a vector that contains a selectable marker. By way of 
example but not limitation, following the introduction of the expression constructs, 

30 engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are 
switched to a selective media. The selectable marker in the expression construct confers 
resistance to the selection and optimally allows cells to stably integrate the expression 
construct into their chromosomes and to grow in culture and to be expanded into cell lines. 
Such cells can be cultured for a long period of time while the desired gene product is 

35 expressed continuously. 

The recombinant cells may be cultured under standard conditions of temperature, 
incubation time, optical density, and media composition. Alternatively, recombinant 



In one embodiment, an expression construct comprising an a2M receptor sequence 
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antigenic cells may be cultured under conditions emulating the nutritional and physiological 
requirements of the cancer cell or infected cell. However, conditions for growth of 
recombinant cells may be different from those for expression of the a2M receptor, HSPs, 
a2M, or other a2MR ligand, or antigenic peptide. 

5 

5.1.2 PEPTIDE SYNTHESIS 

An alternative to producing peptides and polypeptides comprising HSP, a2M 
receptor, a2M or other a2MR ligand sequences, by recombinant techniques is peptide 
synthesis. For example, a peptide corresponding to a portion of an HSP or an a2M peptide 

10 comprising the receptor-binding domain, which can be used as an antagonist in the 

therapeutic methods described herein, can be synthesized by use of a peptide synthesizer. 
Synthetic peptides corresponding to a2M receptor sequences useful for therapeutic methods 
described herein can also be produced synthetically. Conventional peptide synthesis may be 
used or other synthetic protocols well known in the art. 

15 For example, peptides having the amino acid sequence of the a2M receptor, an HSP, 

a2M, or other a2MR Ugand, or an analog, mutein, fragment, or derivative thereof, may be 
synthesized by soUd-phase peptide synthesis using procedures similar to those described by 
Menifield, 1963, J. Am. Chem. Soc, 85:2149. During synthesis, N-a-protected amino acids 
having protected side chains are added stepwise to a growing polypeptide chain linked by its 

20 C-terminal and to an insoluble polymeric support i.e., polystyrene beads. . The peptides are 
synthesized by linking an amino group of an N-a-deprotected amino acid to an a-carboxyl 
group of an N-a-protected amino acid that has been activated by reacting it with a reagent 
such as dicyclohexylcatbodiimide. The attachment of a free amino group to the activated 
carboxyl leads to peptide bond formation. The most commonly used N-a-pix)tecting groups 

25 include Boo which is acid labile and Fmoc which is base labile. Details of ^ropriate 
chemistries, resins, protecting groups, protected amino acids and reagents are well known in 
the art and so are not discussed in detail herein (See, Atherton, et al, 1989, Solid Phase 
Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, Peptide 
Chemistry, A Practical Textbook, 2nd Ed., Springer-Veriag). 

30 Purification of the resulting a2M receptor, HSP, a2M, or other a2MR ligand peptides 

is accompHshed using conventional procedures, such as preparative HPLC using gel 
permeation, partition and/or ion exchange chromatography. The choice of appropriate 
matrices and buffers are well known in the art and so are not described in detail herein. 
In addition, analogs and derivatives of a2M receptor, HSP, a2M, or other a2MR 

35 ligand protein can be chemically synthesized. Furthermore, if desired, nonclassical amino 
acids or chemical amino acid analogs can be mtroduced as a substitution or addition into the 
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a2M receptor, HSP, a2M, or other a2MR ligand sequence. Non-classical amino acids 
include but are not limited to the D-isomers of the common amino acids, a-amino isobutyric 
acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y-Abu, e-Ahx, 6-amino hexanoic acid, 
Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, 
5 hydroxyproline, sarcosine, citruUine, cysteic acid, t-butylglycme, t-butylalanine, 

phenylglycine, cyclohexylalanine, P-alanine, fluoro-amino acids, designer amino acids such 
as p-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. 

10 5.1.3 ANTroODffiS SPECIFIC FOR a2M RECEPTOR-HSP COMPLEXES 

Described herein are methods for the production of antibodies capable of specifically 
recognizing a2M receptor epitopes, HSP-a2M receptor complex epitopes or epitopes of 
conserved variants or peptide fiagments of the receptor or receptor complexes. Such 
antibodies are useful for therapeutic and diagnostic methods of the invention. 

15 Such antibodies may include, but are not limited to, polyclonal antibodies, 

monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies. 
Fab firagments, F(ab02 firagments, fi:agments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. Such 
antibodies may be used, for example, in the detection of an a2M receptor or HSP-a2M 

20 receptor complex in an biological sample. Such antibodies may also be utilized in 

conjunction with, for example, compound screening schemes, as described below, in Section 
5.2, for the evaluation of the effect of test compounds on the interaction between HSPs and 
the a2M receptor. 

Anti-a2M receptor complex antibodies may additionally be used as a method for the 
25 inhibition of abnormal receptor product activity. Thus, such antibodies may, be utilized as 
part of treatment methods for HSP-a2M receptor related disorders, e,g.^ autoimmune 
disorders. 

For the production of antibodies against a2M receptor or receptor complexes, various 
host animals may be immunized by injection with an a2M receptor or HSP-a2M receptor 
30 complex, or a portion thereof. An antigenic portion of a2M receptor or HSP-a2M receptor 
complex can be readily predicted by algorithms known in the art. 

Host animals may include, but are not limited to rabbits, mice, and rats, to name but a 
few. Various adjuvants may be used to increase the immunological response, depending on 
the host species, including but not limited to Freund's (complete and incomplete), mineral 
35 gels such as aluminimi hydroxide, surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and 
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potentially useful human adjuvants such as BCG (bacille Cahnette-Guerin) and 
Corynebacterium parvum. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived 
from the sera of animals immunized with an antigen, such as an a2M receptor or HSP-a2M 

5 receptor complex, or an antigmic functional derivative thereof For the production of 
polyclonal antibodies, host animals such as those described above, may be immunized by 
injection with a2M receptor or HSP-a2M receptor complex, or portion thereof, supplemented 
with adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a 

IQ particular antigen, may be obtained by any technique that provides for the production of 
antibody molecules by continuous cell hues in culture. These include, but are not limited to, 
the hybridoma technique of Kohler and Milstein, (1975, Nature 256, 495-497; and U.S. 
Patent No. 4,376,1 10), the human B-cell hybridoma technique (Kosbor et al, 1983, 
Immunology Today 4: 72; Cole et al, 1983, Proc. Natl. Acad. Sci. USA 80, 2026-2030), and 

15 the EBV-hybridoma technique (Cole et al, 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of any immunoglobulin 
class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing 
the mAb of this invention may be cultivated in vitro or in vivo. Production of high titers of 
mAbs in vivo makes this the presently preferred method of production. 

20 In addition, techniques developed for the production of "chimeric antibodies" 

(Morrison, et al, 1984, Proc. Nafl. Acad. Sci., 81: 6851-6855; Neuberger, et al, 1984, 
Nature 312: 604-608; Takeda, et al, 1985, Nature, 314: 452-454) by splicing the genes from 
a mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can be used. A chimeric 

25 antibody is a molecule in which different portions are derived from different animal species, 
such as those having a variable region derived from a murine mAb and a human 
immunoglobulin constant region (see, e.g,, Cabilly et al, U.S. Patent No. 4,816,567; and 
Boss et al, U.S. Patent No. 4,8 16397, which are mcorporated herein by reference in their 
entirety). 

30 In an additional embodiment of the invention, monoclonal antibodies can be 

produced in germ-free animals (see PCT International Pubhcation No. WO 89/12690, 
published December 12, 1989). According to the invention, human antibodies may be used 
and can be obtained by using human hybridomas (Cote et al, 1983, Proc. Natl. Acad. Sci. 
U.S.A. 80:2026-2030) or by transforming human B cells with EBV virus in vitro (Cole et al, 

35 1985. in Monoclonal Antibodies and Cancer Therapy. Alan R. Liss. pp. 77-96). Techniques 
developed for the production of "chimeric antibodies" (Morrison et aL, 1984, Proc. Natl. 
Acad. Sci. U.S.A. 81:6851-6855; Neuberger et aL, 1984, Nature 312:604-608; Takeda et al, 
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1985, Nature 314:452-454) by splicing the genes fix>m a mouse antibody molecule specific 
for an a2M receptor-HSP complex together with genes firom a human antibody molecule of 
appropriate biological activity can also be used; such antibodies are within the scope of this 
invention. 

5 Humanized antibodies are also provided (see U.S. Patent No. 5,225,539 by Winter). 

An immunoglobuin light or heavy chain variable region consists of a "firamework" region 
interrupted by three hypervariable regions, referred to as complementarity determining 
regions (CDRs). The extent of the fi-amework region and CDRs have been precisely defined 
(see, "Sequences of Proteins of Immunological Interest", Kabat, E. et al, U.S. Department of 

IQ Health and Human Services (1983). Briefly, humanized antibodies are antibody molecules 
fix)m non-human species having one or more CDRs fit)m the non-human species and a 
firamework region firom a human inmiunoglobulin molecule. Such CDRS-grafted antibodies 
have been successfiilly constructed against various antigens, for example, antibodies against 
IL-2 receptor as described in Queen et al, 1989, Proc. Nafl. Acad. Sci. USA 86:10029; 

15 antibodies against the cell surface receptor CAMPATH as described in Riechmann et aL, 
1988, Nature 332:323; antibodies against hepatitis B in Co a/., 1991, Proc. Natl. Acad. Sci. 
USA 88:2869; as well as against viral antigms of the respiratory syncytial virus in Tempest 
et al, 1991, Bio-Technology 9:267. Humanized antibodies are most preferred for 
therapeutic use in humans. 

20 Alternatively, techniques described for the production of single chain antibodies 

(U.S. Patent 4,946,778; Bird, 1988, Science 242: 423-426; Huston et al, 1988, Proc. Natl. 
Acad. Sci. USA 85: 5879-5883; and Ward et al, 1989, Nature 334: 544-546) can be adapted 
to produce single chain antibodies against a2M receptor or HSP-a2M receptor complexes, or 
portions thereof Single chain antibodies are formed by linking the heavy and hght chain 

25 fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide. 
Antibody fragments that recognize specific epitopes may be generated by known 
techniques. For example, such fragments include but are not limited to: the F(ab')2 
fragments, which can be produced by pepsin digestion of the antibody molecule and the Fab 
fragments, which can be generated by reducing the disulfide bridges of the F(ab')2 fragments. 

30 Alternatively, Fab expression libraries may be constructed (Huse et al, 1989, Science, 246: 
1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the 
desired specificity. 

Antibodies to the a2M receptor can, in turn, be utilized to generate anti-idiotype 
antibodies that "mimic" the a2M rec^tor, using techniques well known to those skilled in 
35 the art (see, e.g,y Greenspan & Bona, 1993, FASEB J 7(5):437-444; and NissinofF, 1991, J. 
Immunol. 147(8):2429-2438). For example antibodies which bind to the a2M receptor BCD 
and competitively inhibit tiie binding of HSPs to the a2M receptor can be used to generate 
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anti-idiotypes that "mimic" the ECD and, therefore, bind and neutralize HSPs. Such 
neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can be used in therapeutic 
regimens to neutralize the native ligand and treat HSP-a2M receptor-related disorders, such 
as immunological disorders, proHferative disorders, and infectious diseases. 

5 Alternatively, antibodies to the a2M receptor that can act as agonists of the a2M 

receptor activity can be generated. Such antibodies will bind to the a2M receptor and 
activate the signal transducing activity of the receptor. In addition, antibodies that act as 
antagonist of the a2M receptor activity, i.e. inhibit the activation of the a2M receptor would 
be particularly useful for treating autoimmune disorders, proliferative disord^, such as 

IQ cancer, and infectious diseases. Methods for assaying for such agonists and antagonists are 
described in detail in Section 5.2, below. 



15 



20 



25 



30 



35 



5.2 ASSAYS FOR THE IDENTIFICATION OF COMPOXJNDS THAT INTERACT 
WITH THE a2M RECEPTOR 

The present invention is based on the discovery that the a2M receptor recognizes 
HSP-antigenic peptide complexes and transports them within the cell for the purpose of 
presenting such antigenic molecules to cells of the immune system and eliciting an immune 
response. Thus, methods for identifying compounds that interact with the receptor, or 
enhance or block the function of the receptor, are included in the invention. The present 
invention provides in vitro and in vivo assay systems, described in the subsections below, 
which can be used to identify compounds or compositions that interact with the a2M 
receptor, or modulate the activity of the a2M receptor and its interaction with HSPs or HSP- 
peptide complexes. 

Hie invention provides screening methodologies useful in the identification of small 
molecules, proteins and other compounds which interact with the a2M receptor, or modulate 
the interaction of HSPs with the a2M receptor. Such compounds may bind the a2M 
receptor genes or gene products with differing affinities, and may serve as regulators of 
receptor activity in vivo wifli useful therapeutic 25)plications in modulating the immune 
response. For example, certain compounds that inhibit receptor function may be used in 
patients to downregulate destructive immune responses which are caused by ceUular release 
ofHSPs. 

Methods to screen potential agents for their ability to interact with the a2M receptor, 
or modulate a2M receptor expression and activity can be designed based on the inventor's 
discovery of the receptor and its role in HSP or HSP-peptide complex binding and 
recognition. a2M receptor protein, nucleic acids, and derivatives can be used in screening 
assays to detect molecules that specifically bind to HSP proteins, derivatives, or nucleic 



-28- 



wo 01/92474 



PCT/USOl/18041 



acids, and thus have potential use as agonists or antagonists of the a2M receptor, to modulate 
the immune response. In a preferred embodiment, such assays are performed to screen for 
molecules with potential utility as anti-autoimmuue disease, anti-cancer and anti-infective 
drugs (such as anti-viral drugs and antibiotic drugs), or lead compounds for drug 

5 development. For example, recombinant cells expressing a2M receptor nucleic acids can be 
used to recombinantly produce a2M receptor in these assays, to screen for molecules that 
interfere with the binding of HSPs to the a2M receptor. Similar methods can be used to 
screen for molecules that bind to the a2M receptor derivatives or nucleic acids. Methods that 
can be used to cany out the foregoing are commonly known in the art. 

JO Compounds capable of specifically binding the a2M receptor can be usefiil for 

immunotherapy. In one embodiment, an assay is disclosed for identifying compounds that 
specifically bind the a2M receptor comprising: (a) contacting an a2M receptor with one or 
more test compounds under conditions conducive to binding; and (b) identifying one or more 
test compounds which specifically bind to the a2M receptor, such that a compoimd capable 

1 5 of specifically binding the a2M receptor is identified as a compound usefiil for 
inmiunotherapy. 

Another method encompassed by the invention for identifying a compound usefiil for 
immunotherapy involves identifying a compound which modulates the binding of an a2M 
receptor ligand to the a2M receptor. The term "a2M receptor ligand" as used herem, refers 

20 to an molecule capable ofbinding to the a2M receptor. Such a2M receptor ligands include, 
but are not limited to, a2M and a2M complexes, heat shock proteins and heat shock protein 
complexes, lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), 
urokinase-type plasminogen activator (uPA), and exotoxins. Such ligands are typically 
endocytosed by cell upon binding to the a2M receptor. The method con:q)rises the steps of: 

25 (a) contacting an a2M receptor with an a2M receptor ligand, or fragment, or analog, 
derivative or mimetic thereof, in the presence of one or inore test compound; and (b) 
measuring the amoimt of a2M receptor hgand, or fragment, analog, derivative or mimetic 
thereof, bound to the a2M receptor, such that if the amount of bound a2M receptor ligand- 
measured in (b) differs from the amount of bound a2M receptor measured in the absence of 

30 the test compound, then a compound useful for immunotherapy that modulates the binding of 
an a2M receptor ligand to the a2M receptor is identified. 

In another embodiment, a method for identifying a compound useful for 
immimotherapy which modulates the interaction between the a2M receptor and an a2M 
receptor ligand is provided by the invention. This method comprises the steps of: (a) 

35 contacting an a2M receptor with one or more test compounds; and (b) measuring the level of 
a2M recq>tor activity or expression, such that if the level of activity or expression measured 
in (b) differs from the level of a2M receptor activity in the absence of one or more test 
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compounds, then a compound that modulates the interaction between the a2M receptor and 
an a2M receptor ligand is identified. 

In another embodiment, an assay for identifying a compound that modulates an HSP- 
ct2M receptor-mediated process is disclosed. This assay comprises: (a) contacting a test 

5 compound with an HSP and an a2M receptor; and (b) measuring the level of a2M receptor 
activity or expression, such that if the level of activity or expression measured in (b) differs * 
fi-om the level of a2M receptor activity in the absence of the test compound, then a 
compound that modulates an HSP-a2M receptor-mediated process is identified. In another 
embodiment, in which the compound identified is an antagonist which interferes with the 

IQ interaction of the HSP with the a2M receptor, the method fiutiier comprises the step of 
determining whether the level interferes with the interaction of the HSP and the a2M 
receptor. 

In another embodiment, a cell-based method for identifying a compound that 
modulates an HSP-a2M receptor-mediated process is described. This method comprises the 

1 5 following steps: (a) contacting a test compound with a heat shock protein and an a2M 
receptor-expressmg cell; and (b) measuring the level of a2M receptor activity or expression 
in the cell, such that if the level of activity or expression measured in (b) differs from the 
level of a2M receptor activity in the absence of the test compound, then a compound that 
modulates an HSP-a2M receptor-mediated process is identified. 

20 hi another embodiment, a receptor-ligand binding assay for identifying a compoxmd 

that interacts with a2MR, or modulates the binding of an HSP to a2MR. One such method 
comprises: (a) contacting an HSP with an a2M receptor, or firagment, or analog, derivative 
or mimetic thereof, in the presence of a test compound; and (b) measuring the amount of heat 
shock protein bound to the a2M receptor, or firagment, analog, derivative or mimetic thereof, 

25 such that if the amount of bound heat shock protein measured in (b) differs firom the amount 
of bound heat shock protein measured in the absence of the test compound, then a compound 
that modulates the binding of an HSP to the a2M receptor is identified. 

In another embodiment, a method for identifying a compound that modiilates antigen 
presentation by a2MR-expressing cells is provided by the invention. In one embodiment, 

30 such a method comprises: (a) adding one or more test compounds to a mixture of a2MR- 
expressing cells and a complex comprising an a2MR ligand and an antigenic molecule, 
under conditions conducive to a2MR-mediated endocytosis; (2) measuring the level of 
stimulation of antigen-specific cytotoxic T cells by the a2MR-expressing cells, such that if 
the level measured in (b) differs fi'om the level of said stimulation in the absence of the one 

35 or more test compounds, then a compound that modulates antigen presentation by a2MR- 
expressing cells is identified. In another embodiment, a test compound is added to a mixture 
of a2MR-expressing cells and a complex consisting essentially of an HSP noncovalently 
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associated with an antigenic molecule, under conditions conducive to a2MR-mediated 
endocytosis; and the level of stimulation of antigen-specific cytotoxic T cells by the a2MR- 
expressing cells is measured, such that if the level measiured differs from the level of said 
stimulation in the absence of the test compound, then a compound that modulates HSP- 

5 mediated antigen presentation by a2MR-expressing cells is identified. 

The assays of the present invention may be first optimized on a small scale (ie.y in 
test tubes), and then scaled up for high-throughput assays. In various embodiments, the in 
vitro screening assays of the present invention may be performed using purified components 
or cell lysates. In other embodiments, the screening assays may be carried out in intact cells 

jQ in culture and in animal models. In accordance with the present invention, test compoimds 
which are shown to modulate the activity of the a2M receptor as described herein in vitro, 
will fiirther be assayed in vivo, including cultured cells and animal models to detemiine if the 
test compound has the similar effects in vivo and to determine the effects of the test 
compoimd on antigen presentation, cytokine release, intracellular Ca^ release, T-cell 
, 1 5 cytotoxicity, tumor progression, the accumulation or degradation of positive and negative 
regulators, cellular proliferation, etc. 

5.2. 1 a2M RECEPTOR-LIGAND BINDING ASSAYS 

The screening assays, described herein, can be used to identify compoimds and 
20 compositions, including peptides and organic, non-protein molecules that interact with the 
a2M receptor, or that modulate the interaction between HSPs and the a2M receptor. 
Recombinant, synthetic, and otherwise exogenous compounds may have binding capacity 
and, therefore, may be candidates for pharmaceutical agents. Alternatively, the proteins and 
compoimds include endogenous cellular components which interact with the identified genes 
25 and proteins in vivo. Such endogenous components may provide new targets for 
pharmaceutical and therapeutic interventions. 

Thus, in a preferred embodiment, both naturally occurring and/or synthetic 
compounds libraries of small molecules or peptides), may be screened for interacting 
with a2M receptor and/or modulating a2M receptor activity. In another series of 
30 embodiments, cell lysates or tissue homogenates may be screened for proteins or other 

compounds which bind to one of the normal or mutant a2M receptor genes and a2M receptor 
polypeptides. 

The screening assays described herein may be used to identify small molecules, 
peptides or proteins, or derivatives, analogs and fragments thereof, that interact with and/or 
35 modulate the interaction of HSPs with the a2M receptor. Such compounds may be used as 
agonists or antagonists of the uptake of a2M receptor Ugands, such as HSPs and HSP 
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complexes, by the cell surface receptor. For example, compounds that modulate the a2M 
receptor-ligand interaction include, but are not limited to, compounds that bind to the a2M 
receptor, thereby either inhibiting (antagonists) or enhancing (agonists) the binding of 
ligands, such as HSPs and HSP complexes, to the receptor, as well as compounds that bind to 

5 the ligand, such as for example, HSPs, thereby preventing or enhancing binding of Ugand to 
the receptor. Compounds that affect a2M receptor gene activity (by affecting a2M receptor 
gene expression, including molecules, e.^., proteins or small organic molecules, that affect 
transcription or interfere with splicing events so that expression of the full length or truncated 
forms of a2M receptor can be modulated) can also be identified in the screens of the 

I Q invention. Further, it should be noted that the assays described can also identify compounds 
that modulate a2M receptor ligand, for example HSP, uptake by a2M receptor (e.g., 
compounds which affect downstream signaling in the a2M receptor signal transduction 
pathway). The identification and use of such compounds which affect signaling events 
downstream of the a2M receptor and thus modulate effects of the receptor on the immune 

25 response are within the scope of the invention. 

Compounds that affect the a2M receptor gene activity (by affecting the a2M receptor 
gene e}q)ression, including molecules, e.g., proteins or small organic molecules, that affect 
transcription or interfere with splicing events so that expression of the fiill length or the 
truncated fonn of the a2M receptor can be modulated) can also be identified in the screens of 

20 the invention. However, it should be noted that the assays described can also identify 
compounds that modulate the a2M receptor signal transduction (e,g,, compounds which 
affect downstream signaling events, such as inhibitors or enhancers of endocytic activity 
which is activated by ligand binding to the a2M receptor). The identification and use of such 
compounds which affect signaling events downstream of the a2M receptor and thus 

25 modulate effects of the a2M receptor on the allergenic response are within the scope of the 
invention. 

The screening assays described herein are designed to detect compounds that 
modulate, Le. interfere with or enhance, ligand-receptor interactions, including HSP-a2M 
receptor interactions. As described in detail below, such assays are functional assays, such as 

30 binding assays, that can be adapted to a high-throughput screening methodologies. 

Binding assays can be used to identify compounds that modulate the interaction 
between ligands, for example, HSPs, and the a2M receptor. In one aspect of the invention 
the screens may be designed to identify compounds that disrupt the iateraction between the 
a2M receptor and a ligand, such as, for example, HSPs or peptides derived from an HSP, 

35 a2M, or another a2M receptor ligand. Such compounds will be useful as lead compounds 
for antagonists of HSP-a2M receptor-related disord^ and conditions, such as immune 
disorders, proliferative disorders, and infectious diseases. 
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Binding assays may be perfonned either as direct binding assays or as competition 
binding assays. In a direct binding assay, a test compomid is tested for binding either to the 
a2M receptor or to an a2M receptor ligand, such as an HSP. Then, in a second step, the test 
compound is tested for its ability to modulate the ligand-a2M receptor interaction. 

5 Competition binding assays, on the other hand, assess the ability of a test compound to 
compete with a ligand, Le. an HSP, for binding to the a2M receptor. 

In a direct binding assay, either the ligand and/or the a2M receptor is contacted with a 
test compound under conditions that allow binding of the test compound to the ligand or the 
receptor. The binding may take place in solution or on a solid surface. Preferably, the test 

jQ compound is previously labeled for detection. Any detectable compound may be used for 
labeling, such as but not limited to, a luminescent, fluorescent, or radioactive isotope or 
group containing same, or a nonisotopic label, such as an enzyme or dye. After a period of 
incubation sufficient for binding to take place, the reaction is exposed to conditions and 
manipulations that remove excess or non-specifically bound test compound. Typically, it 

1 5 involves washing with an appropriate buffer. Finally, the presence of a ligand-test 
compound (e.^., HSP-test compound) or a the a2M receptor-test compound complex is 
detected. 

hi a competition binding assay, test compounds are assayed for their ability to disrupt 
or enhance the binding of the ligand {e.g., HSP) to the a2M receptor. Labeled ligand (e.g., 
20 HSP) may be mixed with the a2M receptor or fragment or derivative thereof, and placed 
under conditions in which the interaction between them would normally occur, with and 
without the addition of the test compound. The amount of labeled ligand (e.g., HSP) that 
binds the a2M receptor may be compared to the amount bound in the presence or absence of 
test compound. 

25 In a preferred embodiment, to facilitate complex formation and detection, the binding 

assay is carried out with one or more components immoblilized on a sohd surface. In 
various embodiments, the solid support could be, but is not restricted to, polycarbonate, 
polystyrene, polypropylene, polyethlene, glass, nitrocellulose, dextran, nylon, 
polyacrylamide and agarose. The support configuration can include beads, membranes, 

30 microparticles, the interior surface of a reaction vessel such as a microtiter plate, test tube or 
other reaction vessel. The immobilization of the a2M receptor, or other component, can be 
achieved through covalent or non-covalent attachments. In one embodiment, the attachment 
may be indirect, i.e. through an attached antibody. In another embodiment, the a2M receptor 
and negative controls are tagged with an epitope, such as glutathione S-transferase (GST) so 

35 that the attachment to the solid surface can be mediated by a commercially available 
antibody such as anti-GST (Santa Cruz Biotechnology). 
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For example, such an affinity binding assay may be performed using a the a2M 
receptor which is immobilized to a solid support. Typically, the non-mobilized component 
of the binding reaction, in this case either Ugand (e.g., HSP) or the test compound, is labeled 
to enable detection. A variety of labeling methods are available and may be used, such as 

5 luminescent, cbromophore, fluorescent, or radioactive isotope or group containing same, and 
nonisotopic labels, such as enzymes or dyes. In a preferred embodiment, the test compound 
is labeled with a fluorophore such as fluorescein isothiocyanate (FITC, available from Sigma 
Chemicals, St. Louis). 

The labeled test compounds, or ligand (e.g., HSP) plus test compounds, are then 

2Q allowed to contact with the solid support, under conditions that allow specific binding to 
occur. After the binding reaction has taken place, unbound and non-specifically bound test 
compounds are separated by means of washing the surface. Attachment of the binding 
partner to the solid phase can be accomplished in various ways known to those skilled in the 
art, including but not limited to chemical cross-linking, non-specific adhesion to a plastic 

]^ 5 surface^ interaction with an antibody attached to the solid phase, interaction between a ligand 
attached to the binding partner (such as biotin) and a ligand-binding protein (such as avidin 
or streptavidin) attached to the soUd phase, and so on. 

Finally, the label remaining on the solid surface may be detected by any detection 
method known in the art. For example, if the test compound is labeled with a fluorophore, a 

20 fluorimeter may be used to detect complexes. 

Preferably, the a2M receptor is added to binding assays in the form of intact cells that 
express the a2M receptor, or isolated membranes containing the a2M receptor. Thus, direct 
binding to the a2M receptor or the abiUty of a test compound to modulate a ligand-a2M 
receptor complex (eg., HSP- a2M receptor complex) may be assayed in intact cells in 

25 culture or in animal models in the presence and absence ofthe test compound. A labeled 
Ugand (e.g., HSP) may be mixed with cells that express the a2M receptor, or to crude 
extracts obtained fit>m such cells, and the test compoimd may be added. Isolated membranes 
may be used to identify compoimds that interact with the a2M receptor. For example, in a 
typical experiment using isolated membranes, cells may be genetically engineered to express 

30 the a2M receptor. Membranes can be harvested by standard techniques and used in an in 
vitro binding assay. Labeled ligand (eg., '^I-labeled HSP) is bound to the membranes and 
assayed for specific activity; specific binding is determined by comparison with binding 
assays performed in the presence of excess unlabeled (cold) Ugand. Alternatively, soluble 
a2M receptor may be recombinantly expressed and utilized in non-cell based assays to 

35 identify compounds that bind to the a2M receptor. The recombinantly expressed a2M 
receptor polypeptides or fusion proteins containing the extracellular domain (BCD) of the 
a2M receptor, or one or more subdomains thereof, can be used in the non-ceU based 
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screening assays. Alternatively, peptides corresponding to one or more of the CDs of the 
a2M receptor, or fusion proteins containing one or more of the CDs of the a2M receptor can 
be used in non-cell based assay systems to identify compounds that bind to the cytoplasmic 
portion of the a2M receptor; such compoimds may be useful to modulate the signal 
5 transduction pathway of the a2M receptor. In non-cell based assays the recombinantly 

expressed the a2M receptor is attached to a solid substrate such as a test tube, microtiter well 
or a column, by means well known to those in the art (see Ausubel et aL, supra). The test 
compounds are then assayed for then- ability to bind to the a2M receptor. 



IQ labeled component is allowed to interact with its binding partner(s) in solution. If the size 
differences between the labeled component and its binding partner(s) permit such a 
separation, the separation can be achieved by passing the products of the binding reaction 
through an ultrafilter whose pores allow passage of unbound labeled component but not of its 
binding partner(s) or of labeled component bound to its partner(s). Separation can also be 

25 achieved using any reagent capable of capturing a binding partner of the labeled component 
from solution, such as an antibody against the binding partner, a ligand-binding protein 
which can interact with a ligand previously attached to the binding partner, and so on. 

In a one embodiment, for example, a phage library can be screened by passing phage 
from a continuous phage display Ubrary through a column containing purified a2M receptor, 

20 or derivative, analog, fragment, or domain, thereof, linked to a solid phase, such as plastic 
beads. By altering the stringency of the washing buffer, it is possible to enrich for phage that 
express peptides with high afiSnity for the a2M receptor. Phage isolated from the column 
can be cloned and the affinities of the short peptides can be measured directly. Sequences for 
more than one oUgonucleotide ean be combined to test for even higher affinity binding to the 

25 a2M receptor. Knowing which amino acid sequences confer the strongest binding to the 
a2M receptor, computer models can be used to identify the molecular contacts between the 
a2M receptor and the test compoimd. This will allow the design of non-protein compounds 
which mimic those contacts. Such a compound may have the same activity of the peptide 
and can be used therapeutically, having the advantage of being efficient and less costly to 

3Q produce. 

In another specific embodiment of this aspect of the invention, the solid support is 
membranes containing the a2M receptor attached to a microtiter dish. Test compounds, for 
example, cells that express library members are cultivated under conditions that allow 
expression of the library members in the microtiter dish. Library members that bind to the 
35 protein (or nucleic acid or derivative) are harvested. Such methods, are described by way of 
example in Pannley and Smith, 1988, Gene 73:305-318; Fowlkes et al, 1992, 



Alternatively, the bindiag reaction may be carried out in solution. In this assay, the 
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BioTechniques 13:422-427; PCT Publication No. WO 94/18318; and in references cited 
hereinabove. 

In another embodiment of the present invention, interactions between the a2M 
receptor or ligand (e.g., HSP) and a test compound may be assayed in vitro. Known or 

5 unknown molecules are assayed for specific binding to the a2M receptor nucleic acids, 
proteins, or derivatives imder conditions conducive to binding, and then molecules that 
specifically bind to the a2M receptor are identified. The two components can be measured 
in a variety of ways. One approach is to label one of the components with an easily 
detectable label, place it together with a test component(s) under conditions that allow 

JO binding to occur, perform a separation step which separates bound labeled component firom 
unboimd labeled component, and then measure the amount of boxmd component. In one 
embodiment, the a2M receptor can be labeled and added to a test agent, using conditions that 
allow binding to occur. Binding of the test agent can be determined using polyacrylamide 
gel analysis to compare complexes formed in the presence and absence of die test agent. 

15 In yet another embodiment, binding of ligand (e.g., HSP) to the a2M receptor may be 

assayed in intact cells in animal models. A labeled ligand (e.g., HSP) may be administered 
directly to an animal, with and without a test compound. Uptake of the ligand (e.g., HSP) 
may be measured in the presence and the absence of test compound. For these assays, host 
cells to which the test compound is added may be genetically engineered to express the a2M 

20 receptor and/or ligand (e.g., HSP), which may be transient, induced or constitutive, or stable. 
For the puiposes of the screening methods of the present invention, a wide variety of host 
cells may be used including, but not limited to, tissue culture cells, mammalian cells, yeast 
cells, and bacteria. MammaUan cells such as macrophages or other cells that express the 
a2M receptor, cells of the monocytic lineage, liver parenchymal cells, fibroblasts, 

25 keratinocytes, neuronal cells, and placental syncytiotrophoblasts, may be a preferred cell type 
in which to carry out the assays of the present invention. Bacteria and yeast are relatively 
easy to cultivate but process proteins differently than mammaUan cells. 

5.2.2 a2M RECEPTOR ACTIVITY ASSAYS 

30 After identification of a test compound that interacts with, or modulates the 

interaction of a Ugand (e.g., HSP) with a2MR, the test compoimd can be further 
characterized to measure its effect on a2MR activity and the hgand-a2MR endocytic 
signaling pathway. For example, the test compound may be characterized by testing its 
efTect on ligand (e.g., HSP) /a2MR cellular activity in vivo. Such assays include downstream 

35 signaling assays, antigen presentation assays, assays for antigen-specific activation of 
cytotoxic T cells, and the like. 
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In various embodiments, a candidate compound identified in a primary assay may be 
tested for its effect on innate cx2MR signaling activity. For example, downstream signaling 
effects of a2Mjr€^eptor activation which can be assayed inc^jjde, but are not limited to: 
enhanced locorftLotiJa^and dixemotajas of macrophages (Forrester ei'alf 19^, Intmunology 

5 50: 251-259), down regulation of proteinase synthesis, and elevation of intracellular calcium, 
inositol phosphates and cyclic AMP (Misra et al, 1993, Biochem. J., 290:885-891). Other 
innate immune responses that can be tested are release of cytokines (i.e., IL-12, ILlp, 
GMCSF, and TNFa). Thus, as secondary assays, any identified candidate compoimd can be 
tested for changes in such activities in the presence and absence. 

20 For example, in one embodiment, a chemotaxis assay can be used to fiorther 

characterize a candidate identified by a primary screening assay. It is known that a2M 
modified by protease interaction can induce directional migration of cells towards their 
ligand. A number of techniques can be used to test chemotactic migration in vitro (see, eg., 
Leonard et aL, 1995, "Measurement of a and B Chemokines", in Current Protocols in 

15 Immunology, 6.12.1-6.12.28, Ed. Coligan et al, John Wiley & Sons, Inc. 1995), For 

example, in one embodiment, a candidate compound can be tested for its ability to modulate 
the ability of a2MR to induce migration of cells that express the receptor using a chemokine 
gradient in a multiwell Boyden chemotaxis chamber. In a specific example of this method, a 
serial dilution of a ligand (eg., an HSP) / a2MR antagonist or agonist test compound 

2Q identified in the primary screen is placed in the bottom wells of the Boyden chemotaxis 
chamber. A constant amount of ligand is also added to the dilution series. As a control, at 
least one aliquot contains only ligand (e.g. , HSP). The contribution of the antagonist or 
agonist compound to the chemotactic activity of a2MR is measured by comparing number of 
migratmg cells on the lower surface of the membrane filtCT of the aliquots containing only 

25 ligand (eg., HSP), with the number of cells in aliquots containing test compound and ligand 
{e.g., HSP). If addition of the test compound to the ligand {e.g., HSP) solution results in a 
decrease in the number of cells detected the membrane relative to the number of cells 
detected using a solution containing only ligand (eg., HSP), then an antagonist of ligand 
(eg., HSP) induction of chemotactic activity of a2MR-expressing cells is identified. 

30 Elevation in intracellular ionized calcium concentration ([Ca^^^) is also an indicator 

of a2MR activation (Misra et al, 1993, supra). Thus, in another embodiment, calcium flux 
assays can be used as secondary screens to fiurther characterize modulators of ligand-a2MR 
interactions. Intmcellular calcium ion concentration can be measured in cells that express the 
a2M receptor in the presence of the ligand, in the presence and the absence of a test 

33 compound. For example, calcium mobilization can be detected and measured by flow 
cytometry, by labeling with fluorescent dyes that are trapped intracellularly A fluorescent 
dye such as Indo-1 exhibits a change in emission spectrum upon binding calcium, the ratio of 
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fluorescence produced by the calcium-bound dye to that produce by the unbound dye may be 
used to estimate the intracellular calcium concentration. In a specific embodiment, cells are 
incubated in a cuvette in media containing Indo-1 at 37°C and are excited, and fluorescence 
is measured using a fluorimeter (Photon Technology Corporation, International). The hgand 

5 is added at a specific time point, in the presence and the absence of a test compound, EGTA 
is added to the cuvette to release and chelate total calcium, and the response is measured. 
Binding of ligand results in increased intracellular Ca^^ concentration in cells that express 
a2MR. An agonist results in a relative increased intracellular Ca^^ concentration, whereas an 
antagonist resxilts in a relative decreased intracellular Ca^"^ concentration 

10 In other embodiments, antigen-specific response assays may be used to detect the 

effect of a candidate compound on presentation of antigenic molecule by an a2MR ligand, 
for example an HSP or HSP complex. For example, an antigen presentation assay may be 
performed to determine the effect of a compound in vivo on the uptake of complexes capable 
of interacting with the a2M receptor, e.g., HSP-antigenic molecule complexes, by cells 

1 5 expressing the a2M receptor. Such re-presentation assays are known in the art, and have 
been described previously (Suto and Srivastava, 1995, Science 269:1585-1588). For 
example, in one embodiment, antigen presenting cells, such as a macrophage cell line (e.^., 
RAW264.7), are mixed with antigen-specific T cells in media, using approximately 10,000 
cells of each type at approximately a 1 :1 ratio. Complexes of HSP (10 p.g/ml) and a peptide 

20 antigen, as well as test compound, is added to the cells and the culture is incubated for 
sqpproximately 20 hours. Stimulation of T cells may then be measured in the presence and 
absence of test compound. 

In another embodiment, antigen-specific T cell stimulation may be assayed. In one 
embodiment an IFN-y release assay may be used. After washing, cells are fixed, 

25 pemieabiHzed, and reacted with dye-labeled antibodies reactive with human IFN-7 (PE- anti- 
IFN-y), Samples are analyzed by flow cytometry using standard techmques. Alternatively, a 
filter immxmoassay, ELISA (enzyme linked immunosorbent assay), or enzyme-linked 
immunospot assay (ELISPOT) assay, may be used to detect specific cytokines produced by 
an activated T cell. In one embodiment, for example, a nitrocellulose-backed microtiter plate 

30 is coated with a purified cytokine-specific primary antibody, i.e., anti-IFN-y, and the plate is 
blocked to avoid background due to nonspecific binding of other proteins. A sample of APC 
cells stimulated with antigen is diluted onto the wells of the microtiter plate. A labeled, e.g., 
biotin-labeled, secondary anti-cytokine antibody is added. The antibody cytokine complex 
can then be detected, i.e., by enzyme-conjugated streptavidin - cytoldne-secreting cells will 

35 appear as "spots" by visual, microscopic, or electronic detection methods. In another 
embodiment, 'tetramer staining" assay (Altman et al, 1996, Science 274: 94-96) may be 
used to identify antigen-specific T-cells. For example, an MHC molecule containing a 
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specific peptide antigen, such as a tumor-specific antigen, is multimerized to make soluble 
peptide tetramers and labeled, for example, by complexing to streptavidin. The MHC- 
peptide antigen complex is then mixed with a population of stimulated T cells. Biotin is then 
used to stain T cells which recognize and bind to the MHC-antigen complex. 

5.2.3 COMPOUNDS THAT CAN BE SCREENED IN ACCORDANCE WITH 
THE INVENTION 

The screening assays described herein may be used to identify small molecules, 
peptides or proteins, or derivatives, analogs and fragments thereof, that interact with, or 
modulate the interaction of a ligand HSP) with the a2M receptor. The compounds 
which may be screened in accordance with the invention include, but are not limited to small 
molecules, peptides, antibodies and fragments thereof, and other organic compounds (e.g., 
peptidomimetics) that bind to the ECD of the a2M receptor and either inhibit the activity 
triggered by the natural ligand (z.e., antagonists) or mimic the activity triggered by the 
natural ligand (z.e., agonists), as well as small molecules, peptides, antibodies or fragments 
thereof, and other organic compounds. In one embodiment, such compounds include 
sequences of the a2M receptor, such as the ECD of the a2M receptor (or a portion thereof), 
which can bind to and "neutralize" natural ligands, such as HSPs, a2M, LDL, etc. In 
another embodiment, such compounds include ligand sequences, such as HSP sequences 
and/or a2M sequences, which can bind to the active site of the a2M recq>tor, and block its 
activity. 

Compounds that may be used for screening include, but are not limited to, peptides 
such as, for example, soluble peptides, including but not limited to members of random 
peptide libraries; (see, e.g.^ Lam et al, 1991, Nature 354:82-84; Houghten et al, 1991, 
Nature 354:84-86), and combinatorial chemistry-derived molecular library made of D- and/or 
Lr configuration amino acids, phosphopeptides (including, but not limited to, members of 
random or partially degenerate, directed phosphopeptide libraries; see, e.g., Songyang et al, 
1993, Cell 72:767-778), antibodies (including, but not limited to, polyclonal, monoclonal, 
humanized, anti-idiotypic, chimeric or single chain antibodies, and FAb, F(ab')2 and FAb 
expression library fragments, and epitope-binding fragmmts thereof), and small organic or 
inorganic molecules. 

In one embodiment of the present invention, peptide libraries may be used as a source 
of test compounds that can be used to screen for modulators of a2MR interactions, such as 
HSP- a2M receptor. Diversity libraries, such as random or combinatorial peptide or 
nonpeptide libraries can be screened for molecules that specifically bind to the a2M receptor. 
Many libraries are known in the art that can be used, e.g.y chemically synthesized Ubraries, 
recombinant (e.g., phage display Ubraries), and in vitro translation-based libraries. 
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Examples of chemically synthesized Ubraries are described in Fodor et al, 1991, 
Science 251:767-773; Houghten et al, 1991, Nature 354:84-86; Lam et al, 1991, 
Nature 354:82-84; Medynski, 1994, Bio/Technology 12:709-710; Gallop et al, 1994, J. 
Medicinal Chemistry 37(9): 1233-1251; Ohlmeyer e/o/., 1993, Proc. Natl. Acad. Sci. USA 
5 90:10922-10926; Eib e/ a/., 1994, Proc. Natl. Acad. Sci.USA91:11422-ll426; Houghten 
al, 1992, Biotechniques 13:412; Jayawickreme et al, 1994, Proc. Natl. Acad. Sci. USA 
91:1614-1618; Sahnon etal, 1993, Proc. Natl. Acad. Sci. USA 90:11708-11712; PCT 
Publication No. WO 93/20242; and Brenner and Lemer, 1992, Proc. Natl. Acad. Sci. USA 
89:5381-5383. 

Examples of phage display libraries are described in Scott & Smith, 1990, 
Sciraice 249:386-390; Devlin et al, 1990, Science, 249:404-406; Christian et al., 1992, J. 
Mol. Biol. 227:711-718; Lenstra, 1992, J. ImmunoL Meth. 152:149-157; Kay et al, 1993, 
Gene 128:59-65; and PCT PubHcation No. WO 94/18318 dated August 18, 1994. 

By way of examples of noiq)^tide libraries, abeaazodiazq>ine library (see e.g., Bunin 
et al, 1994, Proc. Natl. Acad. Sci. USA 91:4708-4712) can be adapted for use. Peptoid 
Ubraries (Simon a al, 1992, Proc. Natl. Acad. Sci. USA 89:9367-9371) can also be used. 
Another example of a library that can be used, in whidi the amide functionalities in peptides 
have been permethylated to generate a chemically transformed combinatorial library, is 
described by Ostresh et al (1994, Proc. Natl. Acad. Sci. USA 91 : 1 1 138-1 1 142). 

20 Screening the Ubraries can be accomphshed by any of a variety of commonly known 

methods. See, e.g., the following references, which disclose screening of peptide libraries: 
Parmley & Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; Scott & Smith, 1990, 
Science 249:386-390; Fowlkes et al, 1992; BioTechniques 13:422-427; Oldenburg et al, 
1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu etal, 1994, Cell 76:933-945; Staudt et 

25 al, 1988, Science 241 :577-580; Bock et al, 1992, Nature 355:564-566; Tuerk et al, 1992, 
Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et al, 1992, Nature 355:850-852; U.S. 
Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346, all to 
Ladner et al ; Rebar & Pabo, 1993, Sciaice 263:671-673; and PCT Publication No. 
WO 94/18318. 

30 hi another embodiment of the present invention, the screening may be performed by 

adding the labeled ligand (e.g., HSP) to in vitro translation systems such as a rabbit 
reticulocyte lysate (RRL) system and then proceeding with in vitro priming reaction. In vitro 
translation-based libraries include but are not limited to those described in PCT Publication 
No. WO 91/05058 dated April 18, 1991; and Mattheakis et al, 1994, Proc. Natl. Acad. Sci. 

35 USA 91:9022-9026. 

Compounds that can be tested and identified methods described herdn can include 
but are not limited to, compounds obtained from any commercial source, including Aldrich 
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(Milwaukee, WI 53233), Sigma Chemical (St. Louis, MO), Fluka Chemie AG (Buchs, 
Switzerland) Fluka Chemical Corp. (Ronkonkoma, NY;), Eastman Chemical Company, Fine 
Chemicals (Kingsport, TN)> Boehringer Mannheim GmbH (Mannheim, Gemiany), Takasago 
(Rockleigh, NJ), SST Corporation (Clifton, NJ), Ferro (Zachary, LA 70791), RiedeWeHaen 

5 Aktiengesellschaft (Seelze, Germany), PPG Industries Inc., Fine Chemicals (Pittsburgh, PA 
15272). Further any kind of natural products may be screened using the methods of the 
invention, including microbial, fimgal, plant or animal extracts. 

Furthermore, diversity libraries of test compounds, including small molecule test 
compoimds, may be utilized. For example, libraries may be conmiercially obtained from 

10 Specs and BioSpecs B.V. (Rijswijk, The Netherlands), Chembridge Corporation (San Diego, 
CA), Contract Service Company (Dolgoprudny, Moscow Region, Russia), Comgenex USA 
Inc. (Princeton, NJ), Maybridge Chemicals Ltd; (Cornwall PL34 OHW, United Kingdom), 
and Asinex (Moscow, Russia). 

Still further, combinatorial library methods known in the art, can be utilize, including, 

15 but not limited to: biological libraries; spatially addressable parallel solid phase or solution 
phase libraries; synthetic library methods requiring deconvolution; the "one-bead 
one-compound" library method; and synthetic library methods using aflBnity chromatography 
selectioa The biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of 

20 compounds (Lam,1997, Anticancer Drug Des.l2:145). Combinatorial libraries of test 
compounds, including small molecule test compounds, can be utilized, and may, for 
example, be generated as disclosed in Eichler & Houghten, 1995, Mol. Med. Today 1:174- 
180; DoUe, 1997, Mol. Divers. 2:223-236; and Lam, 1997, Anticancer Drug Des. 12:145- 
167. 

25 Examples of methods for the synthesis of molecular libraries can be found in the art, 

for example in: DeWitt et al, 1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et al, 1994, 
Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al, 1994, J. Med. Chem. 37:2678; Cho 
etal, 1993, Science 261:1303; Carrell etal, 1994, Angew, Chem. Int. Ed. Engl. 33:2059; 
Carell et aL, 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et al, 1994, J. Med. 

30 Chem. 37:1233. 

Libraries of compounds may be presented in solution {e.g., Houghten, 1992, 
BioTechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
1993, Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent Nos. 
5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al, 1992, Proc. Natl. Acad. Sci. 

35 USA 89:1865-1869) or phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 1990, 
Science 249:404-406; Cwirla et al, 1990, Proc. Natl. Acad. Sci. USA 87:6378-6382; and 
Felici, 1991, J. Mol. Biol. 222:301-310). 
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Screening the libraries can be accomplished by any of a variety of commonly known 
methods. See, eg-., the following references, which disclose screening of peptide libraries: 
Parmley & Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; Scott & Smith, 1990, 
Science 249:386-390; Fowlkes et al, 1992; BioTechniques 13:422-427; Oldenburg et al, 

5 1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu et al, 1994, Cell 76:933-945; Standt et 
ai, 1988, Science 241:577-580; Bock et al, 1992, Nature 355:564-566; Tuerk et al, 1992, 
Proc. Natl! Acad. Sci. USA 89:6988-6992; Ellington et al, 1992, Nature 355:850-852; U.S. 
Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346^ all to 
- Ladner et al\ Rebar & Pabo, 1993, Science 263:671-673; and PCT Publication No. 

10 WO 94/18318. 



5.3 IDENTIFICATION OF FRAGMENTS OF THE a2M RECEPTOR AND/OR a2M 
RECEPTOR LIGANDS, SUCH AS HSPS, USEFUL FOR IMMUNOTHERAPY 

The invention also encompasses methods for identifying Ugand-binding a2MR 
fragments (such as 'THSP-binding domains"), and analogs, muteins, or derivatives thereof, 
which are capable of binding to, and uptake of, a2MR ligand-antigenic peptide, such as HSP- 
antigenic peptide complexes. Such Ugand-binding a2MR fragment', e,g,, HSP-binding 
domains, can then be tested for activity in vivo and in vitro using the a2M receptor/ligand 
binding assays, described in Section 5.2.1, above. In one embodiment, such a method for 
identifying an a2MR fragment capable of binding a heat shock protein comprises the steps 
of: (a) contacting a heat shock protein with one or more a2MR fragments; and (b) 
identifying an a2MR polypeptide fragment which specifically binds to the heat shock 
protein. 

Ligand-binding domams, e.^., HSP-binding domains, of the a2MR capable of 
binding ligand-antigenic peptide complexes, such as HSP-antigenic peptide complexes, and 
can be further tested for activity using either in vivo binding assays, re-presentation assays, or 
CTL assays, such as those described in Section 5.2.2, above. For example, one such method 
for identifying an a2MR fragment capable of inducing an HSP-a2M receptor-mediated 
process comprises the steps of: (a) contacting a heat shock protein mfh cell e3q)ressing 
a2MR fragment; and (b) measiuing the level of a2MR activity in the cell, such that if the 
level of the HSP-a2M receptor-mediated process or activity measured in (b) is greater than 
the level of a2MR activity in the absence of the a2MR fragment, then an a2MR fragment 
enable of inducing an HSP-a2M receptor-mediated process is identified. Depending on 
their behavior in such assays, such molecules can be used to either enhance or, alternatively, 
block the frmction of the receptor when administered or expressed in vivo. For example, 
these assays can be used to identify a2MR HSP-binding domains which can bind HSP- 
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antigen complexes and negatively interfere with their uptake by antigen presenting cells. 
These antagonists could be used to downregulate immune responses which are caused by 
cellular release of HSPs. Alternatively, certain a2MR HSP-binding domains may be used to 
enhance HSP-antigen complex uptake and signaling. Such agonists could be administered or 

5 expressed in subjects to elicit an immune response against an antigen of interest. 

In another embodiment, the invention encompasses methods for identifying Ugand 
fragment, such as HSP fragments, which are capable of binding and being taken up by the 
a2M receptor ("a2M receptor-binding domains"), and analogs, muteins, or derivatives 
thereof As described for assays for a2M receptor-related polypeptides described above, 

jQ such a2M receptor-binding domains can then be tested for activity in vivo and in vitro using 
the binding assays described in Section 5.2.1, above. For example, one such method for 
identifying a heat shock protein fragment capable of binding an a2M receptor comprises: (a) 
contacting an a2M receptor with one or more heat shock protein fragments; and (b) 
identifying a heat shock protein fragment which specifically binds to the a2M receptor. 

25 Ligand fragments, such as HSP fragments, of interest may be further tested in cells, 

using in vivo binding assays, re-presentation assays, or CTL assays, such as those described 
m Section 5.2.2, above. For example, in one embodiment, such a method for identifying a 
heat shock protein fragment capable of inducing an HSP-a2M receptor-mediated process 
comprises: a) contacthag an a2M receptor fragment with a cell e^ressing a heat shock 

20 protein; and b) measuring the level of a2MR activity in the cell, such that if the level of the 
HSP-a2M receptor-mediated process or activity measured in (b) is greater than the level of 
a2MR activity in the absence of said heat shock protein fragment. Alternatively, a2M 
receptor-binding domains which decrease uptake of HSPs could be used to block HSP uptake 
by the a2M receptor. In one embodiment, such HSP fragments comprising aOM receptor- 

25 binding domain sequences could be used to construct recombinant fusion proteins, comprised 
of a heat shock protein a2M receptor-binding domain and an antigenic peptide sequence. 
Such recombinant fusion proteins may be used to elicit an immune response and to treat or 
prevent immune diseases and disorders (Suzue et aL, 1997, Proc. Natl. Acad. Sci. U.S A. 



30 The a2M receptor fragments, analogs, muteins, and derivatives and/or ligand (e.g., 

HSP) fragments, analogs, muteins, and derivatives of the invention may be produced by 
recombinant DNA techniques, synthetic methods, or by enzymatic or chemical cleavage of 
native a2M receptor and/or ligands (e.g., HSPs). 

Any eukaryotic cell may serve as the nucleic acid source for obtaining the coding 

35 region of an a2M receptor or a2M receptor Ugand (e.g, , HSP) gene. Nucleic acid sequences 
encoding ligand, e.g., HSPs, and or the a2M receptor can be isolated from vertebrate, 
mammalian, as well as primate sources, includmg humans. Amino acid sequences and 
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nucleotide sequences of naturally occxirring ligands, e.g., HSPs, and a2M receptor are 
generally available in sequence databases, such as Genbank. 

The DNA may be obtained by standard procedures known in the art by DNA 
amplification or molecular cloning directly firom a tissue, cell culture, or cloned DNA (e.g.; a 

5 DNA "library"). Clones derived from genomic DNA may contain regulatory and intron 
DNA regions in addition to coding regions; clones derived from cDNA will contain only 
exon sequences. In a preferred embodiment, DNA can be amplified from genomic or cDNA 
by polymerase chain reaction (PGR) amplification using primers designed from the known 
sequence of an a2M receptor ligand, eg., HSP, a2M, or other a2MR ligand. The polymerase 

IQ chain reaction (PGR) is commonly used for obtaining genes or gene fragments of interest. 
For example, a nucleotide sequence encoding a fragment of any desired length can be 
generated using PGR primers that flank the nucleotide sequence encoding the peptide- 
bindmg domain. Alternatively, an a2MR Mgand, e.g., HSP, a2M, or other a2MR ligand 
receptor gene sequence can be cleaved at appropriate sites with restriction endonuclease(s) if 

J 5 such sites are available, releasing a fragment of DNA encoding the peptide-binding domain. 
If convenient restriction sites are not available, they may be created in the appropriate 
positions by site-directed mutagenesis and/or DNA amplification methods known in the art 
(see, for example, Shankarappa et al, 1992, PGR Method Appl. 1:277-278). The DNA 
fragment that encodes a firigment of the ligand (e.g., HSP) or a2M receptor gene is then 

20 isolated, and ligated into an appropriate expression vector, care being taken to ensure that the 
proper translation reading frame is maintained. Alternatives to isolating the genomic DNA 
include, but are not limited to, chemically synthesizing the gene sequence itself from a 
known sequence or makmg cDNA to the mRNA which encodes the ligand {e.g., HSP) and/or 
a2M receptor. 

25 Any technique for mutagenesis known in the art can be used to modify individual 

nucleotides in a DNA sequrace, for purpose of making amino acid substitution(s) in the 
expressed peptide sequence, or for creating/deleting restriction sites to facihtate ftirther 
manipulations. Such techniques include but are not limited to, chemical mutagenesis, in 
vitro site-dkected mutagenesis (Hutchinson, G., et al, 1978, J. Biol. Ghem 253:6551), 

30 oligonucleotide-directed mutagenesis (Smith, 1985, Ann. Rev. Genet. 19:423-463; Hill et al, 
1987, Methods Enzymol. 155:558-568), PCR-based overlap extension (Ho et al, 1989, Gene 
77:51-59), PCR-based megaprimer mutagenesis (Sarkar et al., 1990, Biotechniques, 8:404- 
407), etc. Modifications can be confirmed by double stranded dideoxy DNA sequencing. 



35 recombinant techniques is peptide synthesis. For example, a peptide corresponding to a 
portion of an a2M receptor and/or ligand (e.g., HSP) comprising tiie substrate-binding 
domain, or which binds peptides in vitro, can be synthesized by use of a peptide synthesizer. 



An altemative to producing a2M receptor and/or Ugand (eg., HSP) firagments by 
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Conventional peptide synthesis may be used or other synthetic protocols well known in the 
art. 

In addition, analogs and derivatives of o2M receptor and/or ligand (e.g.^ HSP) can be 
chemically synthesized. Furthermore, if desired, nonclassical amino acids or chemical 

5 amino acid analogs can be introduced as a substitution or addition into the a2M receptor 
and/or ligand (e.g., HSP) sequence. Non-classical amino acids include but are not limited to 
the D-isomers of the common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, 
Abu, 2-amino butyric acid, y-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric 
acid, 3 -amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, 

jQ citrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, p- 
alanine, fluoro-amino acids, designer amino acids such as p-methyl amino acids, Ca-methyl 
amino acids, Na-methyl amino acids, and amino acid analogs in general. 

a2M recq)tor and/or ligand (eg,, HSP) peptides, or a mutant or derivative thereof 
may be synthesized by solid-phase peptide synthesis using procedures similar to those 

j5 described by Merrifield, 1963, J. Am. Chem. Soc, 85:2149. During synthesis, N-a-protected 
amino acids having protected side chams are added stq>wise to a growing polypeptide chain 
linked by its C-tenninal and to an insoluble polymeric support i.e., polystyrene beads. The 
peptides are synthesized by linking an amino group of an N-a-deprotected amino acid to an 
a-carboxyl group of an N-a-protected amino acid that has been activated by reacting it witii a 

20 reagent such as dicyclohexylcarbodiimide. The attachment of a free amino group to the 
activated carboxyl leads to peptide bond formation. The most commonly used N-a- 
protecting groups include Boc which is acid labile and Fmoc which is base labile. Details of 
appropriate chemistries, resins, protecting groups, protected amino acids and reagents are 
well known in the art and so are not discussed in detail herein (See, Atherton, et al, 1989, 

25 Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, 
Peptide Chemistry, A Practical Textbook, 2nd Ed., Springer-Verlag). 

Purification of the resulting fragment is accomplished using conventional procedures, 
such as preparative HPLC using gel permeation, partition and/or ion exchange 
chromatography. The choice of appropriate matrices and buffers are well known in the art 

30 and so are not described in detail herein. 

In an altemative embodiment, fragments of an cx2M receptor and/or ligand (e.g'., 
HSP) may be obtained by chemical or enzymatic cleavage of native or recombinant a2M 
receptor and^or ligand (e.g., HSP) molecules. Specific chemical cleavage can be performed 
by cyanogen bromide, NaBH4, acetylation, formylation, oxidation, reduction, metabolic 

33 syntiaesis in the presence of tunicamycin, etc, . Endoproteases that cleave at specific sites 
can also be used. Such proteases are known in the art, including, but not limited to, trypsin, 
a-chymotrypsin, V8 protease, papain, and proteinase K (see Ausubel et al, (eds.), in 
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"Current Protocols in Molecular Biology**, Greene Publishing Associates and Wiley 
Interscience, New York, 17.4.6-17.4.8). The a2M receptor and/or ligand HSP) amino 
acid sequence of interest can be exanoined for the recognition sites of these proteases. An 
enzyme is chosen which can release a peptide-binding domain or peptide-binding fragment. 

5 The a2M receptor and/or Ugand HSP) molecule is then incubated with the protease, 
under conditions that allow digestion by the protease and release of the specifically 
designated peptide-binding fragments. Alternatively, such protease digestions can be carried 
out blindly, z.e., not knowing which digestion product will contain the peptide-binding 
domain, using specific or general specificity proteases, such as proteinase K or pronase. 

IQ Once a fi:agment is prepared, the digestion products may be pxirified as described 

above, and subsequently tested for the abiUty to bind peptide or for immunogenicity. 
Methods for detenrdning the immunogenicity of a2M receptor ligand (e.g., HSP) complexes 
by cytotoxicity tests are described in Section 5.2.2. 

15 5.4 DRUG DESIGN 

Upon identification of a compound that interacts with a2MR, or modulates the 
interaction of an a2M receptor ligand, such as an HSP, with the a2M receptor, such a 
compound can be fiirther investigated to test for an ability to alter the immune response. In 
particular, for example, the compounds identified via the present methods can be fiuther 
20 tested in vivo in accepted animal models of HSP-a2MR-mediated processes and HSP- a2MR 
related disorders, such as, eg., immune disorders, prohferative disorders, and infectious 
diseases. 

Computer modeling and searching technologies pemiit identification of compounds, 
or the improvement of ahready identified compounds, which can modulate the interaction of 

25 the a2M recq)tor with its ligand, e.g., an HSR Having identified such a compound or 
composition, the active sites or regions are identified. Such active sites might typically be 
ligand binding sites. The active site can be identified using methods known in the art 
including, for example, fix>m the anouno acid sequences of peptides, from the nucleotide 
sequences of nucleic acids, or from study of complexes of the relevant compound or 

30 composition with its natural ligand. In the latter case, chraiical or X-ray ciystallographic 
methods can be used to find the active site by finding where on the factor the complexed 
ligand is found. 

Next, the three dimensional geometric structure of the active site is determined. This 
can be done by known methods, including X-ray crystallography, which can determine a 
35 complete molecular structure. On the other hand, solid or Uquid phase NMR can be used to 
determine certain intra-molecular distances. Any other experimental method of structure 
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determination can be used to obtain partial or complete geometric structures. The geometric 
structures may be measured with a complexed ligand, natural or artificial, which may 
increase the accuracy of the active site structure determined. 

If an incomplete or insufficiently accurate structure is determined, the methods of 

5 computer based numerical modeling can be used to complete the structure or improve its 
accuracy. Any recognized modeling method may be used, including parameterized models 
specific to particular biopolymers such as proteins or nucleic acids, molecular dynamics 
models based on computing molecular motions, statistical mechanics models based on 
. thermal ensembles, or combined models. For most types of models, standard molecular 

IQ force fields, representing the forces between constituent atoms and groups, are necessary, and 
can be selected fi-om force fields known in physical chemistry. The incomplete or less 
accurate experimental structures can serve as constraints on the complete and more accurate 
structures computed by these modeling methods. 

Finally, having determined the structure of the active site, either experimentally, by 

1 5 modeling, or by a combination, candidate modulating compounds can be identified by 
searching databases containing compounds along with information on their molecular 
structure. Such a search seeks compounds having structures that match the determined active 
site structure and that interact with the groups defining the active site. Such a search can be 
manual, but is preferably computer assisted. These compounds found firom this search are 

20 po^^^tial the a2M receptor-modulating compounds. 

Alternatively, these methods can be used to identify improved modulating 
compounds from an already known modulating compoimd or ligand. The composition of the 
known compound can be modified and the structural effects of modification can be 
• determined using the experimental and computer modeling methods described above applied 

25 to the new composition. The altered structure is then compared to the active site structure of 
the compound to determine if an improved fit or interaction results. In this manner 
systematic variations in composition, such as by varying side groups, can be quickly 
evaluated to obtain modified modulating compounds or ligands of improved specificity or 
activity. 

30 Further experimental and computer modeling methods usefiil to identify modulating 

compounds based upon identification of the active sites of either the a2M receptor or the 
HSP, and other a2M receptor ligands and their analogs, will be apparent to those of skill in 
the art. 

Examples of molecular modelmg systems are the CHARMm and QUANTA 
35 programs (Polygen Corporation, Waltham, MA). CHARMm performs the energy 
nodnimization and molecular dynamics fimctions. QUANTA performs the construction, 
graphic modellmg and analysis of molecular structure. QUANTA allows interactive 
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construction, modification, visualization, and analysis of the behavior of molecules with each 
other. 

A number of articles review computer modeling of drugs interactive with specific 
proteins, such as Rotivinen et al) 1988, Acta Pharmaceutical Fennica 97:159-166); Ripka 

5 (1988 New Scientist 54-57); McKinaly and Rossmann (1989, Annu. Rev. Pharmacol. 
Toxiciol. 29:1 1 1-122); Perry and Davies, OSAR: Quantitative Structure-Activity 
Relationships in Drug Design pp. 189-193 Alan R. Liss, Inc. 1989; Lewis and Dean (1989, 
Proc. R. Soc. Lond. 236:125-140 and 141-162); and, with respect to a model receptor for 
nucleic acid components. Askew et al (1989, J. Am. Chem. Soc. 111:1082-1090). Other 

JO computer programs that screen and graphically depict chemicals are available fix)m 
companies such as BioDesign, Inc. (Pasadena, CA.), AUelix, Inc. ^ssissauga, Ontario, 
Canada), and Hypercube, Inc. (Cambridge, Ontario). Although these are primarily designed 
for application to drugs specific to particular proteins, they can be adapted to design of drugs 
specific to regions of DNA or RNA, once that region is identified. 

15 

5.5 DIAGNOSTIC USES 

The a2M receptor is a cell surface protein present on many tissues and cell types 
(Herz et al, 1988, EMBO J. 7:41 19-27; Moestrup et al, 1992, Cell Tissue Res. 269: 375- 
82), that s^pears to be involved in the specific uptake and re-presentation of a2M receptor 

20 ligands, such as HSPs and HSP- peptide complexes. The a2M receptor was initially 
identified as a heat shock protein receptor due to its interaction with gp96, which is 
exclusively intracellular and is released as a result of necrotic but not apoptotic cell death. 
Thus, gp96 uptake by the a2M receptor may act as a sensor of necrotic cell death. As such, 
a2M receptor-ligand complexes may be used to detect and diagnose proliferative (fisorders, 

25 such as cancer, autoimmune disorders and infectious disease. Therefore, a2M receptor 
proteins, analogues, derivatives, and subsequences thereof, a2M receptor nucleic acids (and 
sequences complementary tiiereto), and anti-a2M receptor antibodies, have uses in detecting 
and diagnosing such disorders. 

The a2M receptor and a2M receptor nucleic acids can be used in assays to detect, 

30 prognose, or diagnose immune system disorders that may result in tumorigenesis, 
carcinomas, adenomas etc, and viral disease. 

The molecules of the present invention can be used in assays, such as immunoassays, 
to detect, prognose, diagnose, or monitor various conditions, diseases, and disorders affecting 
a2M receptor expression, or monitor the treatment thereof In particular, such an 

35 immunoassay is carried out by a method comprising contacting a sample derived from a 
patient with an HSP-a2M receptor specific antibody under conditions such that 
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immunospeciJBc binding can occur, and detecting or measuring the amount of any 
inununospecific binding by the antibody. In a specific aspect, such binding of antibody, in 
tissue sections, can be used to detect aberrant a2M receptor localization or aberrant (e.g., low 
or absent) levels of a2M receptor. In a specific embodiment, antibody to the a2M receptor 

5 can be used to assay a patient tissue or serum sample for the presence of the a2M receptor 
whore an aberrant level of a2M receptor is an indication of a diseased condition. By 
"aberrant levels," is meant increased or decreased levels relative to that present, or a standard 
level representing that present, in an analogous sample firom a portion of the body or firom a 
subject not having the disorder. 

IQ ' The immunoassays which can be used include but are not limited to competitive and 
non-competitive assay ^sterns using techniques such as western blots, immunohisto- 
chemistiy radioimmimoassays, ELISA, "sandwich" inmiunoassays, immunoprecipitation 
assays, precipitin reactions, gel difiusion precipitin reactions, immunodiffusion assays, 
agglutination assays, con^lement^fixation assays, immunoradiometric assays, fluorescent 

2^ immimoassays, protein A immunoassays, to name but a few. 

a2M receptor genes and related nucleic acid sequences and subsequences, including 
complementary sequences, can also be used m hybridization assays. a2M receptor nucleic 
acid sequences, or subsequences thereof, comprising about at least 8 nucleotides, can be used 
as hybridization probes. Hybridization assays can be used to detect, prognose, diagnose, or 

20 monitor conditions, disorders, or disease states associated with aberrant changes in a2M 
receptor expression and/or activity as described supra. In particular, such a hybridization 
assay is carried out by a method comprising contacting a sample containing nucleic acid with 
a nucleic acid probe capable of hybridizing to a2M receptor DNA or RNA, under conditions 
such that hybridization can occur, and detecting or measuring any resulting hybridization. 

25 In specific embodiments, diseases and disorders involving decreased immune 

responsiveness during an infection or malignant disorder can be diagnosed, or their suspected 
presence can be screened for, or a predisposition to develop such disorders can be detected, 
by detecting decreased levels of a2M receptor protein, a2M receptor RNA, or the a2M 
receptor functional activity (e.g., binding to HSP, antibody-binding activity e/c), or by 

30 detecting mutations in a2M receptor RNA, DNA or a2M receptor protein , 

translocations in the a2M receptor nucleic acids, truncations in the a2M receptor gene or 
protein, changes in nucleotide or amino acid sequence relative to wild-type a2M receptor) 
that cause decreased expression or activity of a2M recqptor. Such diseases and disorders 
include but are not limited to those described in Sections 5.7, 5.8, and 5.9. By way of 

35 example, levels of the a2M receptor protein can be detected by immunoassay, levels of a2M 
receptor RNA can be detected by hybridization assays (e.g.. Northern blots, in situ- 
hybridization), a2M receptor activity can be assayed by measuring binding activities in vivo 
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or in vitro. Translocations, deletions, and point mutations in a2M receptor nucleic acids can 
be detected by Southern blotting, FISH, RFLP analysis, SSCP, PGR using primers, 
preferably primers that generate a fragment spanning at least most of the a2M receptor gene, 
sequencing of a2M receptor genomic DNA or cDNA obtained from the patient, etc. 

5 In a preferred embodiment, levels of a2M receptor mRNA or protein in a patient 

sample are detected or measured relative to the levels present in an analogous sample from a 
subject not having the malignancy or hyperproliferative disorder. Decreased levels indicate 
that the subject may develop, or have a predisposition to developing, viral infection, 
malignancy, or hyperproliferative disorder. 

In anoth^ specific raibodiment, diseases and disorders involving a deficient immune 
responsiveness resulting in cell proliferation or in which cell proliferation is desirable for 
treatment, are diagnosed, or their suspected presence can be screened for, or a predisposition 
to develop such disorders can be detected, by detecting mcreased levels of the a2M receptor 
protein, a2M receptor RNA, or the a2M receptor fimctional activity HSP binding or 

J 5 a2M receptor antibody, etc.), or by detecting mutations in a2M receptor RNA, DNA or 
protem translocations in a2M receptor nucleic acids, truncations in the gene or protein, 
changes in nucleotide or amino acid sequence relative to wild-type a2M receptor) that cause 
increased expression or activity of the a2M receptor. Such diseases and disorders include, 
but are not limited to, those described in Sections 5.7, 5.8, and 5.9. By way of example, 

20 levels of the a2M receptor protein, levels of a2M receptor RNA, a2M receptor binding 

activity, and the presence of translocations or point mutations can be determined as described 
above. 

In a specific embodiment, levels of a2M receptor mRNA or protein in a patient 
sample are detected or measiu:ed, relative to fiie levels present in an analogous sample from a 

25 subject not having the disorder, in which increased levels indicate that the subject has, or has 
a predisposition to, an autoimmune disorder. 

Kits for diagnostic use are also provided, diat comprise in one or more containers an 
anti-a2M receptor antibody, and, optionally, a labeled binding partner to the antibody. 
Alternatively, the anti-a2M receptor antibody can be labeled (with a detectable marker, eg:, 

30 a chemiluminescent, enzymatic, fluorescent, or radioactive moiety). A kit is also provided 
that comprises in one or more containers a nucleic acid probe cs^able of hybridizing to a2M 
receptor RNA. In a specific embodiment, a kit can comprise in one or more containers a pair 
of primers (e.g., each in the size range of 6-30 nucleotides) that are capable of priming 
amplification [eg., by polymerase chain reaction (see e.g., Innis et al, 1990, PGR Protocols, 

35 Academic Press, Inc., San Diego, CA), ligase chain reaction (see BP 320,308) use of QP 
replicase, cyclic probe reaction, or other methods known in the art] under appropriate 
reaction conditions of at least a portion of an a2M receptor nucleic acid. A kit can optionally 
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further comprise in a container a predetermined amount of a purified a2M receptor protein or 
nucleic acid, e.g.^ for use as a standard or control. 

5.6 THERAPEUTIC USES 

^ The invention further encompasses methods for modulating the immime response. 

The a2M receptor recognizes and transports antigenic peptide complexes (e.g,, HSP- 
antigenic peptide complexes) for the purpose of presenting such antigenic molecules to cells 
of the immime system and eliciting an immune response. Thus, the compositions and 
methods of the invention may be used for therapeutic treatment of HSP-a2M receptor-related 

10 disorders and conditions, such as autoimmune diseases, cancer and infectious diseases. In 
particular, as described in detail hereinbelow, recombinant cells comprising a2M receptor 
complexes, such as HSP-antigenic peptide complexes, antibodies and other compounds that 
interact with the a2M receptor, or modulate the interaction between the a2M receptor and its 
ligands, e.g., HSP, as well as other compounds that modulate HSP-a2M receptor-mediated 

15 processes may be used to elicit, or block, an immune response to treat such HSP- a2M 
receptor-related disorders and conditions. 

5.6.1 THERAPEUTIC USB OF mENTIFffiDAGOraSTS AND ANTAGONISTS 

Compounds, such as those identified by screening methods provided herein, that 
interact with the a2M receptor (herein "a2MR*0, or modulate the interaction between the 
a2M receptor and its ligand, HSP, can be usefiil as tiierapeutics. Such compounds, 
include, but are not lunited to, agonists, antagonists, such as antibodies, antisense RNAs and 
ribozymes Compounds which interfere with ligand {e.g. , HSP) -a2M receptor interaction 
can be used to block an immune response, and can be used to treat autoimmune responses 
and conditions. Other antibodies, agonists, antagonists, antisense RNAs and ribozymes may 
upregulate ligand (e.g.y HSP)-a2MR interaction, activity, or expression, and would enhance 
the uptake of antigen complexes (e.g., HSP-antigen complexes), and therefore be useful in 
stimulating the host's immime system prior to, or concurrent with, the administration of a 
vaccine. Described below are methods and compositions for the use of such compounds in 
the treatment of HSP-a2M receptor-related disorders, such as immune disorders, proliferative 
disorders, and infectioxis diseases. 

In one embodiment an antagonist of a2M receptor-ligand (e.g, HSP-a2M receptor) 
interaction is used to block the immune response. Such antagonists include compounds that 
interfere with binding of a ligand (e.g, , an HSP) to the receptor by competing for binding to 
the a2M receptor, the ligand, or the ligand-a2M receptor complex. 
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In one embodiment, the antagonist is an antibody specific for the a2M receptor, or a 
fi:agment thereof which contains the HSP ligand binding site. In another embodiment the 
antagonist is an antibody specific for an HSP, which interferes with binding of the HSP to 
the receptor. 

5 In another embodiment, the antagonist is an peptide which comprises at least 

contiguous 10 amino acids of an HSP sequence. Such a peptide can bind to the Ugand 
binding site of the a2M receptor a block the interaction of an HSP or HSP complex. In 
another embodiment, the antagonist is a peptide which comprises at least contiguous 10 
amino acids of a2M sequence, which, like an HSP, can bind to the a2M receptor and 

IQ interfere with the bindmg and uptake of HSP-antigen complexes. In yet another embodiment, 
the antagonist is a peptide which comprises at least contiguous 10 amino acids of a2M 
receptor sequence, in particular the ECD of the a2M receptor (or a portion thereof), which 
can bind to and "neutralize" natural ligands, such as HSPs, a2M, LDL, etc. 

Such peptides may be produced syn&etically or by using standard molecular biology 

25 techniques. Amino acid sequences and nucleotide sequences of naturally occurring a2M 
receptor ligands, such as a2M and HSPs, are generally available in sequence databases, such 
as GenBank. Computer programs, such as Entrez, can be used to browse the database, and 
retrieve any amino acid sequence and genetic sequence data of interest by accession number. 
Methods for recombinant and synthetic production of such peptides are described in Sections 

20 5.1.1 and 5.1.2. 

Additionally, compounds, such as those identified via techniques such as those 
described hereinabove, in Section 5.2, that are capable of modulating a2M receptor gene 
product activity can be administered using standard techniques that are well known to those 
ofskillintheart. 

25 

5.6.1.1 COMPETITIVE ANTAGONISTS OF a2MR-LIGAND 
DSTTERACnONS 

In one embodiment an antagonist of an a2Mr-ligand (e.g., HSP- a2M receptor) 
interaction is used to block the immune response to an antigen complex, e,g., to treat an auto- 

30 immune disorder. Such antagotusts include molecules that interfere with bindmg by binding 
to the a2M recq)tor, thereby interfering with binding of a ligand (^.g., HSP) to the receptor. 
An example of this type of competitive inhibitor is an antibody to a2M receptor, or a 
fragment of a2MR which contains an HSP ligand binding site. Another example of a 
competitive antagonist is a2M, or a receptor-bmding fragment thereof, which itself binds to 

35 a2MR, thereby blocking the binding and uptake of HSP-antigen complexes by the cell. 

An a2MR-ligand (eg., HSP) competitive inhibitor can be any type of molecule, 
including but not limited to a protein, nucleic acid or drug. In a preferred embodiment, an 
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HSP-a2M competitive inhibitor is an a2MR-binding or an HSP-binding peptide. Examples 
of such peptides are provided below. 



In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 
an a macroglobulin, preferably a2M, or a2MR-binding portion ttiereof. 

Functional expression of a2M or a2MR-binding portions thereof (including 
recombinant expression as a FX fusion protein, processing, purification and refolding) is 

10 preferably carried out as described by Holtet et al, 1994, FEES Lett. 344:242-246. 

In a specific mode of the embodiment, an a2MR-binding portion of a2M consists of 
or comprises a firagmoit of the a2M RED consisting of at least 10 (continuous) amino acids. 
In other modes of the embodiment, the fi:agment consists of at least 20, 30, 40, SO, 75 or 100 
amino acids of the RED. In specific modes of the embodiment, such firagments are not larger 

1 5 than 27, 1 38 or 1 S3 amino acids. Most preferred peptides comprise one or both of amino 
acids LySi37o and LySi374. Such peptides include those consisting of amino acids 1299-14S1 
(vRBD in FIG. 13B) (SEQ ID N0:8), 1314-1451 (SEQ ID NO:9) (RED in FIG. 13E) or 
1366-1392 (SEQ ID NO:10) of the mature a2M protein. Other preferred peptides include 
but are not limited to those consisting of amino acids 1300-1425 (SEQ IDN0:11), 1300- 

20 1400 (SEQ ID N0:12), 1300-1380 (SEQ ID N0:13), 1325-1425 (SEQ ID N0:14), 1325- 
1400 (SEQIDN0:15), 1325-1380 (SEQIDN0:16), 1350-1425 (SEQ IDN0:17), 1350- 
1400 (SEQ ID N0:18), or 1350-1380 (SEQ ID N0:19) of the mature human a2M protein. 

Derivatives or analogs of a2M or a2MR-binding portions of a2M are also 
contemplated as competitive antagonists of HSP-a2MR complexes. Such derivative or 

25 analogs include but are not limited to tibiose molecules comprising regions that are 
substantially homologous to a2M, Ihe a2M RED or fragments thereof (e.g., in various 
embodiments, at least 60% or 70% or 80% or 90% or 95% identity over an amino acid 
sequrace of identical size or when compared to an aligned sequence in which the alignment 
is done by a computer homology program known in the art) or whose encoding nucleic acid 

30 is capable of hybridizing to a coding a2M RBD sequence, under stringent, moderately 
stringent, or nonstringent conditions. In certain specific embodiments, an a2M derivative is 
a chimeric or fusion protein comprising an a2M protein or a2MR-binding portion thereof 
(preferably consisting of at least 10 amino acids of the a2M RBD comprising Lys,37o and 
Lysi374) joined at its amino- or carboxy-temiinus via a peptide bond to an amino acid 

35 sequence of a different protein. 

In particular, a2M derivatives can be made by altering a2M coding sequences by 
substitutions, additions or deletions that provide for fimctionally equivalent molecules. Due 
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to the degeneracy of nucleotide coding sequences, other DNA sequences which encode 
substantially the same amino acid sequence as a a2M gene may be used in the practice of the 
present invention. These include but are not limited to nucleotide sequences comprising all 
or a2MR-binding portions of a2M genes which are altered by the substitution of different 

5 codons that encode a functionally equivalent amino acid residue within the sequence, thus 
producing a silent change. Likewise, the a2M derivatives of the invention include, but are 
not limited to, those containing, as a primary amino acid sequence, all or an a2MR-binding 
portion of the amino acid sequence of an a2M protein, including altered sequences in which 
functionally equivalent amino acid residues are substituted for residues within the sequence 

IQ resulting in a silent change. For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a similar polarity which acts as a 
functional equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid belongs. 
For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 

J 5 valine, proline, phenylalanine, tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
positively charged (basic) anoino acids mclude arginine, lysine and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 

The a2M derivatives and analogs of the invention can be produced by various 

2Q methods known in the art. The manipulations which result in their production can occur at 
the gene or protein level. For example, the cloned a2M gene sequence can be modified by 
any of numerous strategies known in the art (Maniatis, T., 1990, Molecular Cloning, A 
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, Cold Spring Haibor, New 
York). The sequence can be cleaved at appropriate sites with restriction endonuclease(s), 

25 followed by further enzymatic modification if desired, isolated, and ligated in vitro. In the 
production of the gene encoding a derivative or analog of a2M, care should be taken to 
ensure that the modified gene remains within the same translational reading firame as a2M, 
uninterrupted by translational stop signals, in the gene region where the desired a2M activity 
is encoded. 

3Q Manipulations of the a2M sequence may also be made at the protein level. Included 

within the scope of the invention are a2M protein fragments or other derivatives or analogs 
which are differentially modified during or after translation, e.g,, by glycosylation, 
acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, 
proteolytic cleavage, linkage to an antibody molecule or other cellular Ugand, etc. Any of 

35 numerous chemical modifications may be carried out by known techniques, including but not 
lindted to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, 
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V8 protease, NaBH4; acetylation, fonnylation, oxidation, reduction; metabolic synthesis in 
the presence of tunicamycin; etc. 

Inaddition, analogs and derivatives of a2M can be chemically synthesized. For 
example, an cz2MR-bmding portion of a2M can be synthesized by use of a peptide 

5 synthesizer. Furthermore, if desired, nonclassical amino acids or chemical amino acid 

analogs can be introduced as a substitution or addition into the a2M sequence. Non-classical 
amino acids include but are not limited to the D*isomers of the common amino acids, a- 
amino isobutyric acid, 4-aminobutyric acid, Abu, 2-aniino butyric acid, y-Abu, 8-Ahx, 
6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-anuno propionic acid, ornithine, 

JO norleucine, norvaline, hydroxyproline, sarcosine, citniUine, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, p-alanine, fluoro-amino acids, designer 
amino acids such as ^-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, 
and amino acid analogs in general. Furtheimore, the amino acid can be D (dextrorotary) or L 
(levorotary). 

25 hi other specific modes of the embodiment, an HSP-a2MR competitive antagonist is 

another a macroglobulin or a2MR-binding portion thereof, for example an a macroglobulin 
RBD domain selected from Nielsen et al^ supra. Fig* 3, Group A. 

RAP 

20 lu one embodiment of the present invention, an HSP-a2MR competitive antagonist is 

a2MR-associated protein (RAP) (Genbank accession no. A39875) or an a2MR-binding 
portion thereof In a specific mode of the embodiment, an a2MR--binding portion of RAP 
consists of or comprises a firagment of the RAP RBD consisting of at least 10 (continuous) 
amino acids. In other modes of the embodunent, the firagment consists of at least 20, 30, 40, 

25 50, 75 or 100 amino acids of the RBD. In specific modes of the embodiment, such fragments 
are not larger than 28, 50 or 100 amino acids. In other specific modes of the embodiment, an 
a2MR-binding portion of RAP comprises an a2MR-binding portion of domain 1 or 3, e.g. as 
depicted ia Nielsen et aL, supra. Fig. 3, Group D or E. Expression of recombinant RAP or 
an a2MR-binding portion thereof, e.g, domain 1 or 3, is preferably achieved as described by 

30 Andeisea et aL, supra). 

5.6.1.1.2 HSP-BINDING PEPTIDES 

a2MR peptides 

In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 
35 a2MR peptide, preferably a soluble peptide, that can bind to HSPs and therefore 
competitively inhibit HSP binding to the native receptor. 
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Functional expression of HSP-binding portions of a2MR is preferably carried out as 
described for the CR8 domain by Huang et al, 1999, J. Biol. Chem 274:14130-14136. 
Briefly, to maintain proper folding, the protein is expressed as a GST fusion, expressed 
recombinantly, the GST portion cleaved, xmcleaved protein removed on GSH-Sepharose, and 

5 cleaved protein refolded. Since the complement repeats bind to calcium, proper folding is 
assayed by measuring the binding of the refolded protein to calcium. 

In a specific mode of the embodiment, an HSP-bmding portion of a2MR consists of 
or comprises at least one complement repeat, most preferably selected from CR3-CR10. In 
another specific mode of the embodiment, an HSP-binding portion of a2MR comprises a 

IQ cluster ofcomplement repeats, most preferably Cl-n. In other modes of the embodiment, the 
HSP-binding portion consists of at least 10, more preferably at least 20, yet more preferably 
at least 30, yet more preferably at least 40, and most preferably at least 80 (continuous) 
amino acids. In specific modes of the embodiment, such firagments are not larger than 40-45 
amino acids. In other specific modes of the embodiment, such firagments are not larger than 

15 80-90 amino acids. Exemplary preferred peptides include but are not limited to those 

consisting of amino acids 25-68 (SEQ ID NO:20), 25-110 (SEQ ID N0:21), 68-110 (SEQ ID 
NO:22), 853-894 (SEQ ID NO:23), 853-934 (SEQ ID NO:24), 853-974 (SEQ ID NO:25), 
853-1013 (SEQ ID NO:26), 853-1060 (SEQ ID NO:27), 853-1102 (SEQ ID NO:28), 853- 
1183 (SEQ ID NO:29), 895-934 (SEQ ID NO:30), 895-974 (SEQ ID N0:31), 895-1013 

20 (SEQ ID NO:32), 895-1060 (SEQ ID NO:33), 895-1 102 (SEQ ID NO:34), 895-1 183 (SEQ 
ID NO:35), 935-974 (SEQ ID NO:36), 935-1013 (SEQ ID NO:37), 935-1060 (SEQ ID 
NO:38), 935-1102 (SEQ ID NO:39), 935-1183 (SEQ ID NO:40), 975-1013 (SEQ ID 
N0:41), 975-1060 (SEQ ID NO:42), 975-1143 (SEQ ID NO:43), 975-1183 (SEQ ID 
NO:44), 1014-1060 (SEQ ID NO:45), 1014-1102 (SEQ ID NO:46), 1014-1183 (SEQ ID 

25 NO:47). 1061-1 102 (SEQ ID NO:48), 1061-1 143 (SEQ ID NO:49), 1061-1 183 (SEQ ID 
NO:50), 1103-1143 (SEQ ID N0:51), 1103-1183 (SEQ ID NO:52), or 1144-1183 (SEQ ID 
NO:53) of human a2MR. 

Derivatives or analogs of HSP-bmding portions a2MR also contemplated as 
competitive antagonists of HSP-a2MR complexes. Such derivative or analogs include but 

3Q are not limited to those molecules comprising regions that are substantially homologous to 
the extracellular domain of (x2MR or fiagments thereof (&.g., in various embodiments, at 
least 60% or 70% or 80% or 90% or 95% identity over an amino acid sequence of identical 
size or when compared to an aligned sequence in which the alignment is done by a computer 
homology program known in the art) or whose encoding nucleic acid is capable of 

35 hybridizing to a sequence encoding an a2MR HSP-binding sequence, under stringent, 

moderately stringent, or nonstringent conditions. In certain specific embodiments, an a2MR 
derivative is a chimeric or fiision protein comprising an HSP-binding portion of a2MR, 
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preferably consisting of at least one complement repeat of Cl-II) joined at its amino- or 
carboxy-terminus via a peptide bond to an amino acid sequence of a different protein. Such a 
chimeric protein can be produced recombinantly as described above, by omitting the 
cleavage repurification steps. 

5 Other HSP-binding a2MR derivatives can be made by altering a2MR coding 

sequences by substitutions, additions or deletions that provide for functionally equivalent 
molecules. Due to the degeneracy of nucleotide coding sequences, other DNA sequences 
which encode substantially the same amino acid sequence as an HSP-binding a2MR gene or 
gene fragment may be used in the practice of the present invention. Selection of suitable 

IQ alterations and production of HSP-binding a2MR derivatives can be made applying the same 
principles described above for a2M derivatives and using the general methods described in 
Sections 5.1.1 and 5.1.2. 

. HSP peptides 

15 In another mode of the embodiment, the antagonist is an peptide which conq)rises at 

least contiguous 10 amino acids of an HSP sequence. Such a peptide can bind to the ligand 
binding site of the a2M receptor a block the interaction of an HSP or HSP complex. . 

Such peptides may be produced synthetically or by using standard molecular biology 
techniques. Amino acid sequences and nucleotide sequences of naturally occurring HSPs are 

2Q generally available in sequence databases, such as GenBank. Computer programs, such as 
Entrez, can be used to browse the database, and retrieve any amino acid sequence and genetic 
sequence data of interest by accession number. Methods for recombinant and synthetic 
production of such peptides are described m Sections 5.1.1 and 5.1.2. 

Additionally, compounds, such as those identified via techniques such as those 

25 described hereinabove, in Section 5.2, that are capable of modulating a2M receptor gene 
product activity can be administered using standard techniques that are well known to those 
of skill in die art. 

5.6.2 THERAPEUTIC USE OF THE a2M RECEPTOR AGAINST CANCER 
AND INFECTIOUS DISEASES 

30 

In another embodiment, symptoms of certain a2M receptor gene disorders, such as 
autoimmune disorders, or proliferative or diiferentiative disorders causing tumorigenesis or 
cancer, may be ameliorated by modulating the level of a2M receptor gene expression and/or 
a2M receptor gene product activity. In one embodiment, for example, a decrease in a2M 
35 receptor gene expression may be useful to decrease a2M receptor activity, and ameliorate the 
symptoms of an autoimmune disorder. In this case, the level of a2M receptor gene 
expression may be decreased by using a2M receptor gene sequences in conjunction with 
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well-known antisense, gene "knock-out," ribozyme and/or triple helix methods. In another 
embodiment, an increase in a2M receptor gene expression may be desired to compensate for 
a mutant or impaired gene in an HSP-a2M receptor-mediated pathway, and to ameUorate the 
symptoms of an HSP- a2M receptor-related disorder. 

5 Among the compounds that may exhibit the ability to modulate the activity, 

expression or synthesis of the a2M receptor gene, including the ability to ameliorate the 
symptoms of an HSP-a2M receptor related disorder are antisense, ribozyme, and triple helix 
molecules. Such molecules may be designed to reduce or.inhibit either imimpaired, or if 
s^propriate, mutant target gene activity. Techniques for the production and use of such 

IQ molecules are well known to those of skill in the art. 

Antisense RNA and DNA molecules act to directly block the translation of mKNA by 
hybridizing to tai^eted mRNA and preventing protein translation. Antisense ^proaches 
involve the design of oligonucleotides that are complementary to a target gene mRNA. The 
antisense oligonucleotides will bind to the complementary target gene mRNA transcripts and 

J 5 prevent translation. Absolute complementarity, although preferred, is not required. 

A sequence "complementary" to a portion of an RNA, as referred to herein, means a 
sequence having sufficient complementarity to be able to hybridize with the RNA, forming a 
stable duplex; in the case of double-stranded antisense nucleic acids, a single strand of the 
duplex DNA may thus be tested, or triplex formation may be assayed. The ability to 

20 hybridize will depend on both the degree of complementarity and the length of the antisense 
nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base mismatches 
with an RNA it may contain and still form a stable duplex (or triplex, as the case may be). 
One skilled in the art can ascertain a tolerable degree of mismatch by use of standard 
procedures to determine the melting point of the hybridized complex. 

25 hi one embodiment, oligonucleotides complementary to non-coding regions of the 

a2M receptor gene could be used in an antisense s^proach to inhibit translation of 
endogenoxis (x2M receptor mRNA. Antisense nucleic acids should be at least six nucleotides 
in length, and are preferably oligonucleotides ranging from 6 to about SO nucleotides in 
length. In specific aspects the oligonucleotide is at least 10 nucleotides, at least 17 

30 nucleotides, at least 25 nucleotides or at least SO nucleotides. 

In an embodiment of the present invention, oligonucleotides complementary to the 
nucleic acids encoding the HSP receptor ligand bindmg .domain are used. 

Regardless of the choice of target sequence, it is preferred that in vitro studies are 
first performed to quantitate the ability of the antisense oligonucleotide to inhibit gene 

35 expression. It is preferred that these studies utilize controls that distinguish between antisense 
gene inhibition and nonspecific biological effects of oUgonucleotides. It is also preferred that 
these studies compare levels of the target RNA or protein with that of an internal control 
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RNA or protein. Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control oligonucleotide. It is 
preferred that the control oligonucleotide is of approximately the same length as the test 
oligonucleotide and that the nucleotide sequence of the oUgonucleotide differs from the 
5 antisense sequence no more than is necessary to prevent specific hybridization to the target 
sequence. 

The oligonucleotides can be DNA or RNA or chimeric mixtures or daivadves or 
modified v^ions th^of, single-stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve 

IQ stability of the molecule, hybridization, etc. The oligonucleotide may include other 

appended groups such as peptides for targeting host cell receptors in vivo), or agents 
facilitating transport across the cell membrane (see, Letsinger et aL, 1989, Proc. Natl. 
Acad, Sci. U.S.A. 86, 6553-6556; Lemaitre etaL, 1987, Proc. Natl. Acad. Sci. 84, 648-652; 
PCT Publication No. WO88/09810, published December 15, 1988) or the blood-brain barrier 

15 (see, PCT Publication No. WO89/10134, pubUshed April 25, 1988), hybridization- 
triggered cleavage agents (see, Krol et al, 1988, BioTechniques 6, 958-976) or 
intercalating agents (see, e.^., Zon, 1988, Pharm, Res. 5, 539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

20 The antisense oligonucleotide may comprise at least one modified base moiety which 

is selected firom the group including but not limited to 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 
5-(carboxyhydroxyknethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 

25 N6-isopentenyladenine, 1-methylguanine, 1-methyUnosine, 2,2-dmiethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomefhyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

30 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety 
selected from the group including but not limited to arabinose, 2-fluoroarabinose, xylulose, 
35 andhexose. 

In yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group consisting of a phosphorotfaioate (S- 
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ODNs), aphosphorodithioate, a phosphoramidothioate, aphosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a fonnacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
5 oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual P-units, the strands run parallel to each 
other (Gautier et al, 1987, Nucl Acids Res. 15, 6625-6641). The oligonucleotide is a 2'-0- 
methykibonucleotide (bioue etal^ 1987, Nucl. Acids Res. 15, 6131-6148), or a chimeric 
RNA-DNA analogue (Inoue et al, 1987, FEES Lett. 215, 327-330). 
IQ OUgonucleotides of the invention may be synthesized by standard methods known in 

the art, e.g, by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al (1988, Nucl. Acids Res. 16, 3209), 
methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer 
15 supports (Sarin et al, 1988, Proc. Natl. Acad. Sci. U.S.A. 85, 7448-7451), etc. 

While antisense nucleotides complementary to the target gene coding region 
sequence could be used, those complementary to the transcribed, untranslated region are 
most preferred. 

In one embodiment of the present invention, gene expression downregulation is 

20 achieved because specific target mRNAs are digested by RNAse H after they have 

hybridized Avith the antisense phosphorothioate oligonucleotides (S-ODNs). Since no rules 
exist to predict which antisense S-ODNs will be more successfixl, the best strategy is 
completely empirical and consists of trying several antisense S-ODNs. Antisense 
phosphorothioate oligonucleotides (S-ODNs) will be designed to target specific regions of 

25 mRNAs of interest. Control S-ODNs consisting of scrambled sequences of the antisense S- 
ODNs will also be designed to assure identical nucleotide content and minimize differences 
potentially attributable to nucleic acid content. All S-ODNs can be synthesized by Oligos 
Etc. (Wilsonville, OR). In order to test the effectiveness of the antisense molecules when 
^plied to cells in culture, such as assays for research purposes or vivo gene therapy 

3Q protocols, cells will be grown to 60-80% confluence on 100 mm tissue culture plates, rinsed 
with PBS and overlaid with lipofection mix consisting of 8 ml Opti-MEM, 52.8 /il 
Lipofectin, and a final concentration of 200 nM S-ODNs. Lipofections will be carried out 
using Lipofectin Reagent and Opti-MEM (Gibco BRL). Cells will be incubated in the 
presence of the lipofection mix for 5 hours. Following incubation the medium will be 

35 replaced with complete DMEM. Cells will be harvested at different time points post- 
lipofection and protein levels will be analyzed by Westem blot. 
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Antisense molecules should be targeted to cells that express the target gene, either 
directly to the subject in vivo or to cells in culture, such as in ex vivo gene therapy protocols. 
A number of methods have been developed for delivering antisense DNA or RNA to cells; 
e.g., antisense molecules can be injected directly into the tissue site, or modified antisense 

5 molecules, designed to target the desired cells (e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on the target cell surface) can be 
administered systemically. 

However, it is often difficult to achieve intracellular concentrations of the antisense 
sufficient to suppress translation of endogenous mRNAs. Therefore a preferred approach 

IQ utilizes a recombinant DNA construct in which the antisense oligonucleotide is placed under 
the control of a strong pol III or pol n promoter. The use of such a construct to transfect 
target cells in the patient will result in the transcription of sufficient amounts of single 
stranded RNAs that will form complementary base pairs with the endogenous target gene 
transcripts and thereby prevent translation of the target gene mRNA. For example, a vector 

25 can be introduced e.g. , such that it is taken up by a cell and directs the transcription of an 
antisense RNA. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for repUcation and expression in mammalian 

20 cells. Expression of the sequence encoding the antisense RNA can be by any promoter 
known in the art to act in mammalian, preferably human cells. Such promoters can be 
inducible or constitutive. Such promoters include but are not limited to: the SV40 early 
promoter region (Bemoist and Chambon, 1981, Nature 290, 304-310), the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et ai, 1980, Cell 

25 22, 787-797), the herpes thymidine kinase promoter (Wagner et al, 1981, Proc. Natl. Acad. 
Sci. U.S.A. 78, 1441-1445), the regulatory sequences of the metallothionein gene (Brinster et 
al, 1982, Nature 296, 39-42), etc. Any type of plasmid, cosmid, YAC or viral vector can be 
used to prepare the recombinant DNA construct which can be introduced directly into the 
tissue site. Alternatively, viral vectors can be used that selectively infect the desired tissue, 

3Q in which case administration may be accomplished by another route (e.g. , systemically). 

Ribozyme molecules designed to catalytically cleave target gene mRNA transcripts 
can also be used to prevent translation of target gene mRNA and, therefore, expression of 
target gene product. (See, e.g., PCX International Publication WO90/1 1364, published 
October 4, 1990; Sarver et al, 1990, Science 247, 1222-1225). In an embodiment of the 

35 present invention, oligonucleotides which hybridize to the HSP receptor gene are designed to 
be complementary to the nucleic acids encoding the HSP receptor ligand binding domaiiL 
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Ribozymes are enz3anatic RNA molecules capable of catalyzing the specific cleavage 
of KNA. (For a review, see Rossi, 1994, Current Biology 4, 469-471). The mechanism of 
ribozyme action involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 

5 composition of ribozyme molecules must include one or more sequences complementary to 
the target gene mRNA, and must include the well known catalytic sequence responsible for 
mRNA cleavage. For this sequence, see, U.S. Patent No. 5,093,246, which is 
incorporated herein by reference in its entirety. 

While ribozymes that cleave mRNA at site specific recognition sequences can be 

IQ used to destroy target gene mRNAs, the use of hanmierhead ribozymes is prefenred. 

Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs witibi the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5*-UG-3\ The construction and 
production of hammerhead ribozymes is well known in the art and is described more fiilly in 

J 5 Myers, 1995, Molecular Biology and Biotechnology: A Comprehensive Desk Reference, 
VCH Publishers, New York, (see especially fig. 4, p. 833).and in Haseloff & Gerlach, 1988, 
Nature, 334, 585-591, which is incorporated herein by reference in its entirety. 

Preferably the ribozyme is engineered so that the cleavage recognition site is located 
near the 5' end of the target gene mRNA, i.e., to increase efficiency and minimize the 

20 intracellular accumulation of non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one that occurs naturally m Tetrahymena 
thermophila (known as the IVS, or L-19 IVS RNA) and that has been extensively described 
by Thomas Cech and collaborators (Zaug et al, 1984, Science, 224, 574-578; Zaug and 

25 Cech, 1986, Science, 231, 470-475; Zaug et al, 1986, Nature, 324, 429-433; published 
Intemational patent application No. WO 88/04300 by University Patents Inc.; Been & Cech, 
1986, Cell, 47, 207-216). The Cech-type ribozymes have an eight base pair active site which 
hybridizes to a target RNA sequence whereafter cleavage of the target RNA takes place. The 
invention encompasses those Cech-type ribozymes which target eight base-pair active site 

30 sequences that are present in the target gene. 

As in the antisense approach, the ribozymes can be composed of modified 
oligonucleotides (e.g., for improved stability, targeting, etc.) and should be delivered to cells 
that express the target gene in vivo, A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme under the control of a strong constitutive pol in or pol n 

35 promoter, so that transfected cells will produce sxifficient quantities of the ribozyme to 
destroy endogenous target gene messages and inhibit translation. Because ribozymes unlike 
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antisense molecules, are catalytic, a lower intracellular concentration is required for 
efficiency. 

Endogenous target gene expression can also be reduced by inactivating or "knocking 
out" the target gene or its promoter using targeted homologous recombination (eg^., see 

5 Smithies a/., 1985, Nature 317, 230-234; Thomas & Capecchi, 1987, Cell 51, 503-512; 
Thompson et aL, 1989, Cell 5, 313-321; each of which is incorporated by reference herein in 
its entirety). For example, a mutant, non-functional target gene (or a completely unrelated 
DNA sequence) flanked by DNA homologous to the endogenous target gene (either the 
coding regions or regulatory regions of the target gene) can be used, with or without a 

IQ selectable marker and/or a negative selectable marker, to transfect cells that express the target 
gene in vivo. Insertion of tiie DNA construct, via targeted homologous recombination, 
results in inactivation of the target gene. Such approaches are particularly suited 
modifications to ES (embryonic stem) cells can be used to generate animal offspring with an 
inactive target gene (e.g., see Thomas & Capecchi, 1987 and Thompson, 1989, supra). 

J 5 However this approach can be adapted for use in humans provided the recombinant DNA 
constructs are directly administered or targeted to the required site in vivo usmg appropriate 
viral vectors. 

Alternatively, endogenous target gene expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of the target gene 

20 (i-e., the target gene promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the target gene in target cells in the body. (See generally, Helene, 1991, 
Anticancer Drug Des., 6(6), 569-584; Helene et al, 1992, Ann. N.Y. Acad. Sci., 660, 27-36; 
andMaher, 1992, Bioassays 14(12), 807-815). 

Nucleic acid molecules to be used in triple helix formation for tiie inhibition of 

25 transcription should be single stranded and composed of deoxyribonucleotides. The base 
composition of these oligonucleotides must be designed to promote triple helix formation via 
Hoogsteen base pairing rules, which generally require sizeable stretches of either purines or 
pyrimidines to be present on one strand of a duplex. Nucleotide sequences may be 
pyrimidine-based, which will result in TAT and CGC** triplets across the three associated 

3Q strands of the resulting triple helix. The pyrimidine-rich molecules provide base 
complementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to that strand. In addition, nucleic acid molecules may be chosen that are purine- 
rich, for example, contain a stretch of G residues. These molecxiles will form a triple helix 
with a DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 

35 located on a single strand of the targeted duplex, resulting in GGC triplets across the three 
strands in the triplex. 
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Alternatively, the potential sequences that can be targeted for triple helix formation 
may be increased by creating a so called "switchback" nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5'-3*, 3'-5* manner, such that they base pair with 
first one strand of a duplex and then the other, eliminating the necessity for a sizeable stretch 

5 of either purines or pyrimidines to be present on one strand of a duplex. 

In instances wherein the antisense, ribozyme, and/or triple helix molecules described 
herein are utilized to inhibit mutant gene expression, it is possible that the technique may so 
efficiently reduce or inhibit the transcription (triple helix) and/or translation (antisense, 
ribozyme) of mRNA produced by normal target gene alleles that the possibility may arise 

^0 wherein the concentration of normal target gene product present may be lower than is 

necessary for a normal phenotype. In such cases, to ensure tiiat substantially normal levels of 
target gene activity are maintained, therefore, nucleic acid molecules that encode and express 
target gene polypeptides exhibiting normal target gene activity may, be introduced into cells 
via gene therapy methods such as those described, below, in Section 5.6.3 that do not contain 

1 5 sequences susceptible to whatever antisense, ribozyme, or triple helix treatments are being 
utilized. Alternatively, in instances whereby the target gene encodes an extracellular protein, 
it may be preferable to co*administer normal target gene protein in order to maintain the 
requisite level of target gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may 

20 be prepared by any method known in the art for the synthesis of DNA and RNA molecules, 
as discussed above. These include techniques for chemically synthesizing oUgodeoxyri- 
bonucleotides and oligoribonucleotides well known in the art such as for example solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in 
vitro and in vivo transcription of DNA sequences encoding the antisense RNA molecule. 

25 Such DNA sequences may be incorporated into a wide variety of vectors that incorporate 
suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 
Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or 
inducibly, depending on the promoter used, can be introduced stably into cell tines. 

30 5.6.3 GENE REPLACEMENT THERAPY 

With respect to an increase in the level of normal a2M receptor gene expression 
and/or a2M receptor gene product activity, a2M receptor gene nucleic acid sequences can, 
for example, be utilized for the treatment of immune disorders resulting in proliferative 
disorders such as cancer. Such treatment can be administered, for example, in the form of 

35 gene replacement therapy. Specifically, one or more copies of a normal a2M receptor gene 
or a portion of the a2M receptor gene that directs the production of an a2M receptor gene 
product exhibiting normal a2M receptor gene Amotion, may be inserted into the appropriate 
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cells within a patient, using vectors that include, but are not limited to adenovirus, adeno- 
associated virus, and retrovirus vectors, in addition to other particles that introduce DNA into 
cells, such as liposomes. 

Gene replacement therapy techniques should be capable of delivering a2M receptor 
5 gene sequences to cell types that express the HSP receptor within patients. Thus, in one 
embodiment, techniques that are well known to those of skill in the art (see, e.g,, PCT 
Publication No. WO89/10134, published April 25, 1988) can be used to enable a2M receptor 
gene sequences to be delivered to developing cells of the myeloid lineage, for example, to the 
bone manow. In another specific mbodiment, gene replacement can be accomplished using 
macrophages in vitro, and delivered to a patient using the techniques of adoptive 
immunotherapy. 

In another embodiment, techniques for delivery involve direct administration of such 
a2M receptor gene sequences to the site of the cells in which the a2M receptor gene 
sequences are to be expressed, e.g., directly at the site of the tumor. 

15 Additional methods that may be utilized to increase the overall level of a2M receptor 

gene expression and/or a2M receptor gene product activity include the introduction of 
impropriate a2M receptor-expressing cells, preferably autologous cells, into a patient at 
positions and in numbers that are sufficient to ameliorate the symptoms of an a2M receptor 
disorder Such cells may be either recombinant or non-recombinant. 

20 Among the cells that can be administered to increase the overall level of a2M 

receptor gene expression in a patient are cells that normally express the a2M receptor gene. 

Alternatively, cells, preferably autologous cells, can be engineered to express a2M 
receptor gene sequences, and may then be introduced into a patient in positions appropriate 
for the amelioration of the symptoms of an a2M receptor disorder or a proliferative or viral 

25 disease, canc^ and tumorigenesis. Altemately, cells that express an unimpaired a2M 
receptor gene and that are from a MHC matched individual can be utilized, and may include, 
for example, brain cells. The expression of the a2M receptor gene sequences is controlled by 
the appropriate gene regulatory sequences to allow such expression in the necessary cell 
types. Such gene regulatory sequences are well known to the skilled artisan. Such cell-based 

30 gene therapy techniques are well known to those skilled in tihe art, see, e.g., Anderson, U.S. 
Patent No. 5,399,349. 

When the cells to be administered are non-autologous cells, they can be administered 
using well known techniques that prevent a host immune response against the introduced 
cells from developing. For example, the cells may be introduced in an encapsulated form 

35 which, while allowing for an exchange of components with the immediate extracellular 
environment, does not allow the introduced cells to be recognized by the host immune 
system. 
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5.6.4 DELIVERY OF SOLUBLE a2M RECEPTOR POLYPEPTIDES 
Genetically engineered cells that express soluble a2M receptor ECDs or fusion 
proteins, e,g., fusion Ig molecules can be admimstered in vivo where they may function as 
"bioreactors" that deliver a supply of the soluble molecules. Such soluble a2M receptor 
5 polypeptides and fusion proteins, when expressed at appropriate concentrations, should 
neutralize or "mop up*' HSPs or other native ligand for the a2M receptor, and thus act as 
inhibitors of a2M receptor activity and may therefore be used to treat HSP-a2M receptor- 
related disorders and diseases, such as autoimmune disorders, proliferative disorders, and 
infectious diseases. 



5.6.5 DELIVERY OF DOMINANT NEGATIVE MUTANTS 

In another embodiment of the invention, dominant negative mutants ("dominant 
negatives") may be used therapeutically to block the immune response to an HSP-antigen 
complex, e.g., to treat an auto-immune disorder. In general, such dominant-negatives are 

15 mutants which, when expressed, interact with ligand (i.e, HSP-antigenic molecule 
complex), but lack one or more functions, i.e. endocytotic functions and/or signaling 
functions, of normal a2MR. Such mutants interfere with the function of normal a2MR in the 
same cell or in a different cell, e,g. by titration of HSP-peptide complexes fiom the wild type 
receptor. Such a mutation, for example, can be one or more point mutation(s), a deletion, 

20 insertion, or other mutation in either the extracellular of the 5 1 5 kDa subunit, or the 

extracellular, transmembrane or intracellular domains of the 85 kDa subunit of the alpha(2) 
macroglobulin receptor {see Krieger and Hen:, 1994, Annu. Rev. Biochem 63:601-637 for 
a2MR subunit configuration). However, in construction of dominant negative mutations in 
the either subunit, care should be taken to ensure that the cleavage domain (signaling 

25 cleavage between aas 3525 and 3526 of the precursor of a2MR) remains intact so that the 
515 kDa subunit is processed and presented on the cell surface. Additionally, care should be 
taken to ensure that the domains by which the two subunits associate should also remain 
functional. For example, in a specific embodiment, the C-terminal intracellular doniain of 
the 85 kDa subunit is truncated. In another embodiment, a point mutation on the N-terminal 

30 515 kDa subunit blocks endocytosis but not ligand binding. In another embodiment, the N- 
terminal 515 kDa subunit is e^ressed as a fusion protein, wherein the C-terminus of said 
fusion protein is the transmembrane domain and optionally the intracellular domain, of 
another Type I single transmembrane receptor. 



35 by normal functional receptors in the same or neighboring cells by titrating out the amount of 
available ligand. Thus, a recombinant antigen presenting cell expressing such a dominant 



10 



Egression of a such a dominant negative mutation in cell can block uptake of ligand 
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negative can be used to titrate out HSP-antigenic molecule complexes when administered to 
a patient in need of treatment for an autoimmune disorder. 

5.6.6 EXTRACORPOREAL METHODS FOR MODULATING THE IMMUNE 
RESPONSE 

The present invention also relates to methods for modulating an immune response in 
a patient by altering tiie levels a2M receptor Ugand in the bloodstream using extracorporeal 
methods. a2M receptor acts as a heat shock protein receptor in a2M receptor-expressing 
cells, such as macrophages and dendritic cells. Binding of HSPs or HSP antigenic peptide 
complexes to such a2M receptor-expressing cells results in internalization of the HSP and 
the re-presentation of peptides chaperoned by the HSP. However, because a2M receptor has 
a diverse roles in different cell types and binds numerous non-HSP ligands, competition 
between a2M receptor ligands reduces the abiUty of HSPs and HSP complexes to access 
a2M receptor. 

The Applicant has discovered that depleting the blood of non-HSP-a2M receptor 
ligands and transfusing such a2M receptor-ligand-depleted blood into the bloodstream of a 
patient can be used to stimulate the immune response, perhaps by increasing access of HSP 
complexes to the a2M receptor. Altematively, blood can be depleted of a2M receptor 
ligands, including HSPs, followed by the addition of HSPs or HSP antigenic peptide 
complexes to stimulate a specific immune response. Decreasing the levels of a2M receptor 
ligands can be used to enhance a desired immune response in patients, such as patients with 
cancer and infectious disease. Such methods for depletion of a2M receptor ligands to the 
bloodstream are described in detail below. 

In various embodunents, extracorporeal procedures, such as transfusion and 
apheresis, may be used to stimulate an immune response by modulating a2M receptor ligand 
levels in a patient's circulation or altematively, depleting a2M receptor ligands including 
HSPs fi:om the blood, followed by the selective addition of specific HSPs or HSP antigenic 
peptide complexes to the blood. For example, in one embodiment, apheresis techniques 
coupled with affinity column technology, are used to remove a2M receptor ligand firom a 
patients blood, followed by the return the ligand-depleted blood into circulation. 

In another embodiment, apheresis techniques coupled with affinity chromatography 
techniques are used to remove a2M receptor Ugand firom a patient's blood followed by the 
selective addition of HSPs or HSP antigenic peptide complexes to the patient's blood, and 
retum of the treated blood into the patient's circulation. 

Extraction of blood can be performed either manually or by any one of the common 
automated, electronically controlled "apheresis" systems such as the Autopheresis-C.RTM. 
system (Baxter Healthcare Corporation, Fenwal Division, 1425 Lake Cook Road, Deerfield, 
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m. 60015). la a preferred embodiment, a blood separation apparatus is fluidly connected to a 
blood vessel of the patient by way of a blood extraction tube. A blood pump, such as a 
peristaltic pump, is positioned on the blood extraction tube to pump blood from the patient to 
a blood separation apparatus. An anticoagulant, such as heparin, can be added to the blood 

5 through a separate chamber that is in fluid communication with the apheresis system. 

Optionally, blood can be taken out of the apheresis system, treated to remove a a2M 
receptor ligand in the laboratory, and then put back into the apheresis system to be 
reintroduced to the patient. In another embodiment, the blood can be further separated into 
cellular components such that only a specific subset of cells (i.e. leukocytes) can be treated to 

IQ remove an a2M receptor ligand and retumed to the patient or, alternatively, only the plasma 
can be treated to remove an a2M receptor ligand and retumed to the patient. In another 
embodiment, after the blood has been treated to remove an a2M receptor ligand, HSPs are 
added back to the blood. 

In various embodiments, blood from a patient can be withdrawn manually and the 

2 5 cells can be separated by a standard laboratory blood cell collection device. After or during 
the cellular collection, the blood can be treated to remove an a2M receptor ligand. The cells 
can then be retumed to the patient by an i.v. drip or by injection with a syringe. 

In one embodiment, transfiision/apheresis methods may be used to enhance an 
immune response. a2M receptor ligands are removed from transfused blood of a patient in 

20 need of treatment for an inraiune disorder. In another embodiment, the a2M receptor ligand 
that is removed from the blood is not a heat shock protein. 

One example of such a method comprises the following steps: (1) withdrawing blood 
fix>m a patient; (2) passing the patient's blood over an affinity column comprising a a2M 
receptor ligand-binding compound, such as an antibody specific for a a2M receptor ligand, 

25 for a time period and under conditions sufficient to allow binding of a2M receptor ligand to 
the affinity column; (3) retuniing the cx2M receptor-ligand depleted blood to flie patient 

In another embodiment, apheresis methods may be used to enhance an immune 
response by depletmg a2M receptor ligands (including HSPs) followed by the addition of 
selective HSPs or HSP antigenic peptide complexes to the blood of a patient. 

30 An example of such a method comprises the following steps: (1) withdrawing blood 

from a patient; (2) passing the patient's blood over an affinity column comprising a a2M 
receptor-Ugand-binding compound for a time period and imder conditions sufficient to allow 
binding of the a2M receptor ligand to the afiSnity column; (3) adding HSPs or HSP antigenic 
peptide complexes to the ligand depleted blood; (4) returning the blood to the patient. 

35 Methods that can be used to remove a Ugand from the blood include affinity 

chromatography, anion or cation exchange chromatography, phosphocellulose 
chrontiatography, immunoaffinity chromatography, hydroxyapatite chromatography, and 
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lectin chromatography. AfiBnity purification is based on the interaction between the 
compound on the affinity colimm and its binding partner. The principle of affinity 
chromatography is well known in the art. In one embodiment, a recombinantly expressed and 
purified (or partially purified) protein, such as a2M receptor, is covalently or non-covalently 

5 coupled to a solid support such as, for example, a chromatography column. The extracted 
blood firom a patient can be run over such a column. The coupled protein will bind to the 
a2M receptor ligand and deplete the blood of the a2M receptor ligand. The depleted blood 
can then be returned to the patient In another embodiment, an antibody specific to the 
ligand can be coupled to the chromatography column and the immunospecific binding of an 

IQ antibody to the a2M receptor ligand can be used to deplete the blood of the a2M receptor 
ligand. Alternatively, one of the many cation or anion exchange resins commonly used in 
the art can be used to deplete the blood of the a2M receptor ligand. 

In another embodiment, the present invention also includes a kit that comprises a 
solid phase chromatography colmnn with a purified a2M receptor ligand binding molecule 
attached thereto. Such a kit can contain components necessary for extracorporeal removal of 
a2M receptor ligands firom the blood of a patient in need of such treatment. 

Transfiision/apheresis methods may also be used in combination with other methods 
of immunotherapy. In one embodiment, for example, after depletion of non-HSP a2M 
receptor ligands as described above, HSP-antigenic pq)tide complexes may be delivered to a 

20 cancer patient, or a patient having an infectious disease, using the transfiision/apheresis 
methods, or other method. Using transfiision/apheresis, at the same time as HSP-antigenic 
peptide complexes are being delivered, a2M receptor Ugands (other than HSPs) may be 
removed firom the patient's blood, in order to stimulate the immune response against the 
HSP-antigenic peptide complex being delivered. Thus, the transfiision/apheresis method 

25 makes it possible to accomplish both the delivery of HSP-antigenic peptide complexes and 
the removal of competing a2M receptor ligands in a single procedure. 



5.7 TARGET AUTOIMMUNE DISEASES 

30 Autoimmune diseases that can be treated by the methods of the present invention 

include, but are not limited to, insulin dependent diabetes mellitus IDDM, or 
autoimmune diabetes), multiple sclerosis, systemic lupus erythematosus, Sjogren's 
syndrome, scleroderma, polymyositis, chronic active hepatitis, mixed connective tissue 
disease, primary biliaiy cirrhosis, pernicious anemia, autoimmune thyroiditis, idiopathic 

35 Addisorfs disease, vitiligo, gluten-sensitive enteropathy. Graves' disease, myasthenia gravis, 
autoimmune neutropenia, idiopathic thrombocytopenia purpura, rheumatoid arthritis, 
cirrhosis, pemphigus vulgaris, autoimmune infertility, Goodpasture's disease, bullous 
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pemphigoid, discoid lupus, ulcerative colitis, and dense deposit disease. The diseases set 
forth above, as referred to herein, include those exhibited by animal models for such 
diseases, such as, for example non-obese diabetic (NOD) mice for IDDM and experimental 
autoimmune encephalomyelitis (EAE) mice for multiple sclerosis. 
5 The methods of the present invention can be used to treat such autoimmune diseases 

by reducing or eliitninating the immune response to the patient's own (self) tissue, or, 
altematively, by reducing or eliminating a pre-existing autoimmune response directed at 
tissues or organs transplanted to replace self tissues or organs damaged by the autoimmune 
response. 

10 

5.8 TARGET INFECTIOUS DISEASES 

The infectious diseases that can be treated or prevented using the methods and 
compositions of the present invention include those caused by intracellular pathogens such as 
viruses, bacteria, protozoans, and intracellular parasites. Viruses include, but are not limited 

25 to viral diseases such as those caused by hepatitis type B virus, parvoviruses, such as 
adeno-associated virus and cytomegalovirus, papovaviruses such as papilloma virus, 
polyoma viruses, and SV40, adenoviruses, herpes viruses such as herpes simplex type I 
(HSV-I), herpes simplex type n (HSV-II), and Epstein-Barr virus, poxviruses, such as 
variola (smallpox) and vaccinia virus, RNA viruses, including but not limited to human 

20 immunodeficiency virus type I (HIV -I), himian immunodeficiency virus type 11 (HIV -II), 
human T-cell lymphotropic virus type I (HTLV-I), and himian T-cell lymphotropic virus 
type n (HTLV-n); influenza virus, measles virus, rabies virus, Sendai virus, picomaviruses 
such as poliomyelitis virus, coxsackieviruses, rhinoviruses, reoviruses, togaviruses such as 
rubella virus (German measles) and Semhki forest virus, arboviruses, and hepatitis type A 

25 virus. 

In another embodiment, bacterial infections can be treated or prevented such as, but 
not limited to disorders caused by pathogenic bacteria including, but not limited to. 
Streptococcus pyogenes^ Streptococcus pneumoniae^ Neisseria gonorrhoea^ Neisseria 
meningitidis, Corynebacterium diphtheriae, Clostridium botulinum, Clostridium perfringens, 

30 Clostridium tetania Haemophilus influenzae^ Klebsiella pneumoniae, Klebsiella ozaenae, 
Klebsiella rhinoscleromotiSy Staphylococcus aureus^ Vibrio cholerae, Escherichia coli, 
Pseudomonas aeruginosa, Campylobacter (Vibrio) fetus , Campylobacter jejuni, Aeromonas 
hydrophila, Bacillus cereus, Edwardsiella tarda. Yersinia enterocolitica. Yersinia pestis, 
Yersinidi pseudotuberculosis. Shigella dysenteriae. Shigella flexneri. Shigella sonnei, 

35 Salmonella typhiimurium. Salmonella typhii, Treponema pallidum, Treponema pertenue, 
Treponema carateneum, Borrelia vincentii, Borrelia burgdorferi, Leptospira 
icterohemorrhagiae, Mycobacterium tuberculosis. Toxoplasma gondii, Pneumocystis carinii. 
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Francisella tularensis. Brucella abortus. Brucella suis^ Brucella melitensis, Mycoplasma 
spp.y Rickettsia prowazeki, Rickettsia tsutsugumushi. Chlamydia spp.^ ^nd Helicobacter 
pylori. 

In another preferred embodiment, the methods can be used to treat or prevent 
5 infections caused by pathogenic protozoans such as, but not linuted to, Entomoeba 

histolytica. Trichomonas tenas. Trichomonas hominis. Trichomonas vaginalis, Trypanosoma 
gambiense, Trypanosoma rhodesiense, Trypanosoma cruzi, Leishmania donovaniy 
Leishmania tropica, Leishmania braziliensis, Pneumocystis pneumonia, Plasmodium vivax, 
Plasmodium falciparum, and Plasmodium malaria. 

10 

5.9 TARGET PROLff ERATIVE CELL DISORDERS 

With respect to specific proliferative and oncogenic disease associated with HSP- 
a2M receptor activity, the diseases that can be treated or prevented by the methods of the 
present invention include, but are not limited to: human sarcomas and carcinomas, e.g., 
1^ fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 
synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 
carcinoma, pancreatic cancer, breast cancer, ovarian canc^, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
20 carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilms' tumor, cervical cancer, testicular 
tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
glioma, astrocytoma, meduUoblastoma, craniopharyngioma, ependymoma, pinealoma, 
25 hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, 
neuroblastoma, retinoblastoma; leukemias, e.g., acute lymphocytic leukemia and acute 
myelocytic leukemia (myeloblastic, promyelocytic, myelomonocytic, monocytic and 
erythroleukemia); chronic leukemia (chronic myelocytic (granulocytic) leukemia and chronic 
lymphocytic leukemia); and polycythemia vera, lymphoma (Hodgkin's disease and non- 
30 Hodgkin*s disease), multiple myeloma, Waldenstrom's macroglobuliuCTua, and heavy chain 
disease. 

Diseases and disorders involving a deficiency in cell proliferation or in which cell 
proliferation is desired for treatment or prevention, and that can be treated or prevented by 
inhibiting the a2M receptor Amotion, include but are not limited to degenerative disorders, 
3S growth deficiencies, hypoproliferative disorders, physical trauma, lesions, and wounds; for 
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example, to promote wound healing, or to promote regeneration in degenerated, lesioned or 
injured tissues, etc. 

5.10 PHARMACEUTICAL PREPARATIONS AND METHODS OF 
ADMINISTRATION 

5 

The compounds that are determined to affect a2M receptor gene expression or gene 
product activity can be administered to a patient at ther^eutically effective doses to treat or 
ameliorate a cell proliferative disorder. A flierapeutically effective dose refers to that amount 
of the compound sufficient to result in ameUoration of symptoms of such a disorder. 

10 

5.10.1 EFFECTIVE DOSE 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procediures in cell cultures or experimental animals, e.g., for determining the 
LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically 

J 5 effective in 50% of the population). The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the ratio LD5/BD50. Compounds that exhibit 
large therapeutic indices are preferred. While compounds that exhibit toxic side effects may 
be used, care should be taken to design a delivery system that targets such compounds to the 
site of affected tissue in order to minimize potential damage to uninfected cells and, thereby, 

20 reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably witbin a range of circulating concmtrations that include the ED50 with little or no 
toxicity. The dosage may vary within this range depending upon the dosage form employed 

25 and the route of administration utilized. For any compound used in the method of the 
inv^tion, the therapeutically effective dose can be estimated initially firom cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (ie., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 

30 information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance Uquid chromatography. 



35 
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5.10.2 FORMULATIONS AND USE 

Pharmaceutical compositions for use in accordance with the present invention may be 
formulated in conventional manner using one or more physiologically acceptable carriers or 
excipients. 

5 Thus, the compounds and their physiologically acceptable salts and solvates may be 

formulated for administration by inhalation or insufQation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

For oral administration, the pharmaceutical compositions may take the form of, for 
example, tablets or capsules prepared by conventional means with pharmaceutically 

IQ acceptable excipients such as binding agents (e^., pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g,, lactose, 
microciystalline cellulose or calcium hydrogen phosphate); lubricants (eg., magnesium 
stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch glycolate); or 
wetting agents (e:g., sodium lauryl sulphate). The tablets may be coated by methods well 

15 known in the art Liquid preparations for oral administration may take tiie form of, for 
example, solutions, syrups or suspensions, or they may be presented as a dry product for 
constitution with water or other suitable vehicle before use. Such liquid preparations may be 
prepared by conventional means with pharmaceutically acceptable additives such as 
suspending agents {e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 

20 emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., methyl or propyl- 
p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, 
flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give controlled 

25 release of the active compound. 

For buccal administration the compositions may take the form of tablets or lozmges 
formulated in conventional maimer. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation tcom 

3Q pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 

dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 
or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of e.g., gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 

35 and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by 
bolus injection or continuous infusion. Formulations for injection may be presented in unit 
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dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may be in powder form for 
5 constitution with a suitable vehicle, e.g, , sterile pyiogen-firee water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g,, containing conventional suppository bases such as cocoa butter or 
other glycerides. 

In addition to the formulations described previously, the compounds may also be 
IQ formulated as a depot preparation Such long acting formulations may be adnunistered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange 
resins, or as sparingly soluble derivatives, for exan^le, as a sparingly soluble salt. 
J 5 The compositions may, if desired, be presented in a pack or dispenser device that may 

contain one or more unit dosage forms containing the active iagredient. The pack may for 
example comprise metal or plastic foil, such as a blister pack. The pack or dispenser device 
may be accompanied by instructions for administration. 

20 

6. EXAMPLE: IDENTIFICATION OF a2M RECEPTOR AS AN HSP RECEPTOR 

6.1 INTRODUCTION 

The Example presented herein describes the successful identification of an interaction 
25 between gp96, hsp90, hsp70, and calreticulin with the a2M receptor present in macrophages 
and dendritic cells. The experiments presented herein form the basis for isolating a2M 
receptor polypeptides and for the screening, diagnostic, and therapeutic methods of the 
present invention. 

The Applicant of the present invention noted that certain observations were 
30 inconsistent with a "direct transfer" model of HSP-chaperoned peptide antigen presentation. 
First, the immimogenicity of HSP prq)arations is dependent on the presence of functional 
phagocytic cells but not B cells or oth^ nonprofessional antigen-presenting cells, (Udono 
and Srivastava, 1993, supra; Suto and Srivastava, 1995, supra), whereas free peptides can 
sensitize all cell types. Second, extremely small quantities of HSP-peptide complexes were 
35 effective in eliciting specific immunity, ie., gp96-chaperoned peptides are several hundred 
times as effective as free pq)tides in sensitizing macrophages for CTL recognition, 
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suggesting the possibility of a specijfic uptake mechanism. Third, gp96-chaperoned peptides 
elicited an MHC I response that was not limited by the size of peptide. Finally, the 
processing of gp96-peptide complexes in macrophage was found to be sensitive to Brefeldin 
A (BFA), which blocks transport through the Golgi apparatus, suggesting that processing 

5 occurred through an intercellular mechanism. These observations led to the hypothesis that 
HSP-chaperoned peptides may be processed intemally and re-presented by MHC class I 
molecules on fhe cell surfaces of macrophages (Suto and Srivastava, 1995, supra). There is 
also the hypothesis that the maimose receptor is used in the uptake of gp96 but no 
mechanism has been proposed for the non-glycosylated HSPs, such as HSP70 (Ciupitu et al, 

IQ 1998, J. Exp. Med., 187: 685-691). Others suggested that a novel intracellular trafficking 
pathway may be involved for the transport of peptides from the extracellular medium into the 
lumen of ER pay et al, 1997, Ptoc. Natl. Acad. Sci. 94:8065-8069; Nicchitta, 1998, Cuir. 
Opin. in LnmunoL 10: 103-109). Further suggestions include the involvement of phagocytes 
which (a) possess an ill-defined pathway to shunt protein from the phagosome into the 

15 cytosol where it would enter the normal class I pathway; (b) digest ingested material ha 
lysosomes and regurgitate peptides for loading on the -surface to class I molecules (Bevan, 
1995, J. Exp. Med. 192:639-41). The discovery of a receptor for heat shock proteins as 
disclosed herein helps to resolve the paradox of how extracellular antigenic peptides 
complexed to HSPs can be presented by MHC class I molecules on antigen presenting cells. 

20 

6.2 MATERIALS AND METHODS 

Mice, cells, and reagents. C57B1/6, BALB/c and TAP(-/-) mice were obtained from 
Jackson laboratories. Bone marrow-derived DCs were generated from the femurs and tibia of 
C57BL/6 mice. The bone marrow was flushed out and the leukocytes obtained and cultured 

25 as described (Lutz et a/.,1999, J. Immunol. Methods 223:77-92) m complete RPMI1640 with 
10% heat inactivated FCS and 20ng/ml GMCSF (Bndogen Inc., Wobum, MA) for 6 days. 
On day 3 fresh media with GMCSF was added to the plates for the day 6 cultures. 
Macrophages were obtained from PEMs of pristaned mice by positive selection for CDl lb+ 
cells (Miltenyi Biotech Inc.). RAW264.7 was gift of Dr. Christopher Nicchitta. A20.25 was 

30 gift of Dr. Lawrence Kwak. All other cell lines were obtained from ATCC. Proteasome 
inhibitor Lactacystin was purchased from Kamiya Inc. Japan. Anti-CD91 antibody (clone 
5A6) was purchased from PRAGEN (Heidelberg). Anti-hsp70 (clone N27F3) and anti-PDI 
(clone 1D3) antibodies were purchased from StressGen (Victoria, Canada). 

Purification of HSPs. HSPs were purified as described (Srivastava, P.K., 1997, 

35 Methods: A companion to Methods in Enzymology 12:165-171; Basu and Srivastava, 1999, 
J. Exp. Med. 189(5):797-802). All buffers used for purifications were prepared with 
endotoxin free water (Nanopure Infijoity UV/UF, Bamstead/Thermolyne, Dubuque, lA) and 
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all glasswares used for purification were cleaned with endotoxin fi-ee water and baked in a 
4000F oven (Gruenberg, Wlliamsport, PA). The HSP-containing fractions were identified by 
inmiunoblots. 

Conjugation of proteins to FITC and staining of cells. Purified proteins were 
5 conjugated to FITC using the FluoroTag FITC conjugation kits (SIGMA) as per the 

manufacturers protocol. Conjugation was confirmed by a 2kDa increase in molecular weight 
by SDS-PAGE and by immunoblotting with an anti-FITC monoclonal antibody. Incubations 
. of indicated amounts of FITC-tagged proteins and cells were done in file presence of 1% 
nonfat dry milk (Carnation®) in PBS for 20min at 4**C. After repeated washing, cells were 
10 analyzed by flow cytometry (Becton Dickenson, La Jolla, Califionia). Cells were also labeled 
with propidiiun iodide just before FACScan analysis. Cells staining positive for propidium 
iodide were gated out of the events. No diflFerences were obsCTved in the binding of HSPs to 
Mac-1+ cells from pristaned or non-pristaned mice. Fixed or unfixed cells were labeled with 
FirC-tagged HSP as above. Labeled cells were visualized using a Zeiss LSM confocal 
1^ microscope. 

Affinity chromatography. Proteins (Img) in 2ml volume were incubated with 2ml of 
equilibrated AminoLink beads in PBS with a reductant (NaCNBHj) for 1 hour. Uncoupled 
protein was removed by extensive washing of the column and unreactive groups quenched. 
Immobilization yields were typically >92% of the starting amount of protein. Columns were 

2Q stored at 4°C until used. Such columns were made with gp96 (purified as described in 
Srivastavaera/., 1986, Proc. Nati. Acad. Sci., U.S.A. 83:3407-3411) and albumin. For 
membrane purification, cells were lysed by dounce homogenization in hypotonic buffer 
containing PMSF. Unlyzed cells and nuclei were removed by centrifiigation at lOOOg for 5 
mm. The postnuclear supernatant was centrifiiged at lOO.OOOg for 90 mins. The pellet 

25 contains total membranes and was fractionated by aqueous two-phase partition with a 
dextran/polyethylene glycol biphase. Briefly membranes were resuspended in PEG (33% 
wt/wt m 0.22 M sodium phosphate buflFer, pH 6.5) and underlaid gently wifli dextran 
(20%wt/wt in 0.22M sodium phosphate buflF^, pH 6.5). The two phases were mixed gentty 
and centrifiiged at 2000 g for 15 mins. The white material at the interphase was enriched for 

30 plasma membranes, whose proteins were extracted by 2 hr incubation in 20mM Tris buffer 
(pH8, containing 0.08% octylglucoside) at 4°C. 

Photo cross-linking ofgp96 to putative receptor. The cross-linker (SASD, (Pierce) 
was labeled with I^^' using iodobeads (Pierce). Radiolabeled SASD was covalently attached 
to gp96 by incubation at room temperature for 1 hr. Free SASD and I^^ were removed by 

35 size exclusion column (KwikSep columns. Pierce). For cross-linking studies, I*"-S ASD- 
gp96 (50 }ig gp96) was incubated with purified CDl Ib"^ cells. Unbound protem was 
removed by washing. All procedures to this point -were carried out in very dim light. 
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Proteins were cross-linked with UV light. Cells were lysed with lysis buffer (0.5%NP4O, 
lOmM Tris, ImMEDTA, 150mM NaCl) and treated with 100 mM 2-mercaptoethanol to 
cleave the cross-linker. Cell lysates were analyzed by SDS-PAGE and autoradiography. 
Re-presentation assays. Re-presentation assays were carried out as described (Suto 

5 and Srivastava, 1995, Science 269:1585-1588). Antigen presenting cells (RAW264.7 
macrophage cell line) were plated at a 1 :1 ratio with AH I -specific T cells in complete 
RPMI. Approximately 10,000 cells of each type were used. Gp96 (10 jig/ml) chaperoning 
the AHl-20 mer peptide (RVTY HSPSYVYHOFE RRAK^ was added to the cells and the 
entire culture was incubated for 20 his. Stimulation of T cells was measured by quantifying 
the amount of IFN-Y released into the supematants by ELIS A (Endogen). In addition, 
CDl lb+ peritoneal exudate cells (1X104) were pulsed with HSPs purified from liver, or 
HSP-peptide con^lex generated in vitro and relevant Q)8+ T cells (VSV8 specific CTL line 
or AHl-specific CTL clones, as indicated) were added to the cultures. The assay was carried 
out in 250 ml volume in 96*well plates with RPMI medium containing 5% PCS at 370C for 
5 20 hours. Culture supematants were harvested and tested for the presence of IFN-7 release by 
ELISA (Endogen Inc., Wobum, MA). 

Complexing in vitro of peptide to HSPs. HSPs were mixed with VSV19 or AHl-19 
in a 50 : 1 peptide to protein molar ratio in 0.7M NaCl in Na - phosphate buffer and heated at 
500 C for 10 min., then incubated at room temperature for 30 min. Excess free peptide was 

20 removed with PBS using centricon 10 ( Amicon, Inc., Beverly MA). 

Purification of CDllb-^ cells. CDllb+ cells were selected using the MACS columns 
and protocols supplied by Miltenyi Biotec Inc. Auburn, California. CDl lb antibody, 
suppUed as CDl lb MicroBeads, was purchased from Miltenyi Biotec Inc., and has been 
demonstrated not to activate CDl lb+ cells with regard to the mark^s tested in this 

25 experiment. 

Induction of cytotoxic T cells. C57BL/6 mice were immimized intraperitoneally with 
50 mg of gp96 complexed with VSV19 peptide. Ten days later, recipient spleens were 
removed and splmocytes were stimulated with VSV8 synthetic peptide at ImM 
concentration. After S days, MLTCs were tested for cytotoxicity in a chromium release assay 

3Q using EL4 cells alone and EL4 cells pulsed with VSV8 peptide as targets. 

Protein Microsequencing. Proteins identified by afSnity chromatography were 
analyzed on SDS-PAGE and stained with coomasie blue or transferred onto PVDF 
membrane and stained with coomasie blue, all of it under keratin-free conditions. Protein 
bands were excised with a razor from the gel or membrane. Tryptic peptides from an 80kDa 

35 faint coomassie band were extracted by 50% acetonitrile, 5% formic acid, dried, and loaded 
onto a 75 m 10 cm, reverse-phase C18, microc^illary column (3 jil vol) and tryptic peptides 
were separated by on-line microcapillaiy liquid chromatography-tendem mass spectrometry 
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followed by database searching using the SEQUEST program as previously described. 
(Gatlin et a/.,2000. Anal. Chem. 72:757-63; Link et al, 1999, Nat Biotechnol 17:676-82). 
The analysis was carried out in a data-depend^t auto-MS/MS fashion using a Finnigan LCQ 
iontrap Mass Spectrometer. 

5 

6.3 RESULTS 

Identification of an 80 kDa protein as a potential gp96 realtor. Homogenous 
preparations of gp96 were coupled to FTTC and the gp96-FITC was used to stain RAW264.7 
cells, shown to be functionally capable of re-presenting gp96-chaperoned peptides. Gp96- 

10 FITC but not control albumin-FITC preparations stained the cell surface of RAW264.7 cells 
(FIG. 1 A). Plasma membrane preparations of cell surface-biotinylated RAW264.7 cells were 
solubilized in 0.08% octyl-glucoside and ttie soluble extract was applied to a gp96-Sepharose 
column. Theboundproteins were eluted with 3M sodium chloride. SDS-PAGE analysis of 
the eluate showed 2 major bands of -75-80 kDa size (FIG. IB, top left). Blotting of this gel 

15 with avidin-peroxidase showed that both bands were biotinylated, indicating their surface 
localization (FIG. IB, bottom left). Affinity purification of membrane extracts of RAW264.7 
cells over control serum albumin affinity columns did not result in isolation of any proteins, 
nor did probing of immunoblots of such gels with avidin pCTOxidase detect any albumin- 
binding surface proteins (FIG. IB, top and bottom center lanes). As an additional control, 

20 chromatography of membrane extracts of P815 cells which do not bind gp96-FITC and 

which do not re-present gp96-chaperoned peptides, on gp96 affinity columns did not result in 
elution of any gp96-binding proteins (FIG. IB, top and bottom right lanes). 

In parallel experiments, gp96 molecules were coupled to the radio-iodinated linker 
sutfosuccinimidyl (4-azidosalicylamido) hexanoate (SASD) which contains a photo cross- 

25 linkable group. Gp96-S ASD-I^^ was pulsed onto peritoneal macrophages, which have been 
shown previously to re-present gp96-chaperoned peptides (Suto and Srivastava, 1995, 
Science 269: 1 S85-1 588). Excess gp96-SASD was removed by multiple rounds of washing 
of the cells and photoactivation was carried out by exposure of cells to UV Ught for 10 min. 
Cell lysates were reduced in ord^ to transfer the f ^ group to flie putative gp96 ligand and 

3Q were analyzed by SDS- PAGE followed by autoradiography. The gp96 molecule was 
observed to cross-link to an ~80 kDa band specifically present in re-presentation-competent 
macrophage but not in the re-presentation-incompetent P815 cells (FIG. IC). This band 
appears to correspond in size to the larger of the two bands seen in eluates of gp96 afiSnity 
columns (FIG. IB). No band corresponding to the lower band in that preparation is seen in 

35 the photo cross-linked preparation. The observation ofa specific binding ofgp96 to an 80 
kDa protein in two different re-presentation-competent cell types, but not in a re- 
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presentation-incompetent cell line, and by two independent assays supported the candidacy 
of the 80 kDa molecule for the gp96 receptor. 

Antiserum against the 80 kDa protein inhibits re-presentation of a gp96-chaperoned 
antigenic peptide. The eluates containing the 75-80 kDa proteins were used to immunize a 

5 New Zealand white rabbit, and pre-immune and immune sera were used to probe blots of 
plasma membrane extracts of the re-presentation-competent RAW264.7 and primary 
peritoneal macrophages and the re-presentation-incompetent P81S cells. The immune but not 
the pre-immune serum detected the 80 kDa band (and a faint lower 75 kDa band) in plasma 
membrane extracts of primary macrophage and the RAW264.7 membranes but not of P815 
cells (FIG. 2A). The pre-immune and immime sera were tested in a functional assay for their 
ability to block re-presentation of gp96-cIuq)eroned peptides. The L^-restricted epitope AHl 
derived fiom the gp70 antigen of murine colon carcinoma CT26 (Huang et al^ 1996, Proc. 
Natl. Acad. Sci. U.S.A. 93:9730-9735) was used as the model system. Complexes of gp96 
with an AHl precursor (used to inhibit direct presentation) were pulsed onto RAW264.7 

15 cells which were used to stimulate a L*VAHl-specific CD8+ T cell clone. Release of 
interferon-y by the clones was measured as a marker of their activation. RAW264.7 cells 
were able to re-present gp96-chaperoned AHl precursor effectively in this assay. It was 
observed that at the highest concentration, the immune sera inhibited re-presentation 
completely (FIG. 2B). Although the pre-immune serum was ineffective in inhibiting re- 

20 presentation as compared to the immune sera, it did inhibit re-presentation significantly at 
higher concentrations. The significance of this observation became clear later when we 
determined the identity of the gp96 receptor. Repeated immunizations with the affinity- 
purified gp96-binding proteins did not result in corresponding increase in antibody titers. 

Identification of the 80 kDa protein as an amino terminal fragment of the heavy chain 

25 of the a2M receptor. The 80 kDa protem eluted from the gp96 affinity column was resolved 
on SDS-PAGE and visualized by staining with Coomassie Brilliant Blue. The protein band 
was subjected to in-gel trypsin digestion and mass spectrometry-based protein 
microsequencing as described in the methods in Section 6.2. Four independent tryptic 
peptides corresponding to N-terminal region of the a 2-macroglobulin (a2M) receptor, 

3Q designated by immunologists as CD91, were identified (FIG. 3C). 

a2M inhibits re-presentation of a gp96'Chaperoned antigenic peptide by RA W264, 7. 
a2M receptor is one of the known natural ligands for the a2M receptor. Its ability to inhibit 
re-presentation of gp96-chaperoned antigenic peptide AHl was tested in the assay described 
in FIG. 2. a2M but not control proteins selectin (CD62) or serum albumin was observed to 

35 inhibit re-presentation completely and titratably (FIG. 4). This observation was also 
consistent with the result in FIG. 2 that while the pre-immune serum did not detect an 80 
kDa band in plasma membranes of RAW264.7 cells^ it did inhibit re-presentation to some 
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degree at high concentrations. Thus, by structural as well as functional criteria, the a2M 
receptor was determined to fulfill the criteria essential for a receptor for gp96. 

Binding of fluorescence-labeled HSPs and czj-macroglobulin to a panel of primary 
and cultured cells. FITC-labeled HSPs, gp96, hsp90 or hsp70, or control non-HSP serum 

5 albumin (S A) were incubated with primary cells such as pristane-induced peritoneal 
macrophage, differentiated bone marrow-derived dendritic cells or with immortalized cell 
lines such as RAW264.7, RAW309Cr.l of macrophage origin, P815 mastocytoma, YAC-1 
lymphoma, EL4 thymoma, Meth A and PS-C3H fibrosarcomas, B16 melanoma, CT26 colon 
carcinoma, and UV6139 squamous cell carcinoma, as described in the Methods. After 

10 removal of unbound protein by extensive washing, cells were analyzed by flow cytometry. 
As shown in Figure 5, the peritoneal macrophages and the bone marrow-derived dendritic 
cells showed robust binding of each of the three HSPs but not albumin. However, of the two 
macrophage cell lines, only one of them, RAW264.7, bound the three HSPs. 
RAW309Cr.l.did not bind any of the HSPs (FIG. 6Aand6B). Out of 8 other cell lines tested 

15 with the FITC-labeled gp96, hsp90 and hsp70, none was observed to bind to HSP in a 
manner comparable to the binding observed with RAW264.7. YAC 1 was observed to bind 
hsp70 but only to a significantly smaller degree. The binding was only a firaction of that 
observed with APCs. 

As described above, the a2 macroglobulin receptor has been identified as the receptor 

20 for gp96. All of the cell types in Figure 5 were also tested for the presence of CD91 by 
staining with FITC-a2 macroglobulin. CD91 showed precisely the same pattern of 
distribution as did each of the three HSPs (FIG. 5). 

The ability of cells to bind HSPs and correlates with the ability to re-present 
gp96'Chaperoned peptides. We tested if the ability of a particular cell type to bind HSPs or 

25 macroglobulin as shown in Figure 5 correlates with its ability to re-present gp96- 

chaperoned peptides. Re-presentation studies are done typically by incubating APCs and an 
HSP, chaperoning a known peptide, with T cells specific for an epitope present in the 
chaperoned peptide (Suto and Srivastava,199S, supra). The experimental system is set up 
such that the peptide cannot charge directly onto MHC I but requires intracellular processing 

30 followed by presentation to T cells. VSV8 and AHl antigenic systems were used in these 
studies. The VSV8 epitope (RGYVYQGL) is presented by the K** molecule and VSV19 
(SLSDL RGYVYOGLK SGNVS^ is its extended variant, which cannot charge K** directly. 
AHl (SPSYVYHQF) is an L*^-restricted epitope of a murine leukaemia virus envelope 
protein gp70 (Huang et al.,1996), and AHl-19 fRVTY HSPSYVYHOFE RRAK^ is its 

3^ extended version. Peritoneal macrophage and BM-DCs were tested side-by-side for re- 
presentation in the VSV8 system, and both cell types were able to re-present gp96- 
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chaperoned VSV19 to VSV8-specific T ceUs (FIG. 7A). EL4 and B16 cells, both of the b 
haplotype, were also tested and were found unable to re-present in identical assays (data not 
shown). The BM-DCs were observed to re-present gp96-chaperoned VSV19 significantly 
better than macrophage did; however, it is not possible to determine fi'om the data if this 

5 difiference derives firom the better T cell stimulatory properties of DCs in general or whether 
the DCs are specifically more efficient than macrophage at re-presenting gp96-chaperoned 
peptides. The two macrophage cell lines RAW309Cr. 1 and RAW264.7 were tested for their 
re-presentation ability in the AHl system. In parallel with the HSP and a2M-staimng data 
(FIG. 5), RAW264.7 cells but not RAW309Cr.l were observed to be capable of re- 

10 presenting gp96-chaperoned AHl peptides (Fig. 7B). 

Peptides chaperoned by hsp90, hsp70 and CRT are re-presented by MHC I molecules 
ofAPCs. Gp96 was the first HSP for which the re-presentation phenomenon was 
experimentally shown (Suto and Srivastava 1995, supra). Hsp70-ch£q)eroned peptides have 
been shown recently to be re-presented by APCs (Castellino et al, 2000, J.Exp Med. 

15 191(11):1957-1964). The ability ofotherHSPs,hsp90 and CRT to introduce chaperoned 
peptides into the endogenous presentation pathway was tested in the AHl system with 
RAW264.7 cells as the APCs. RAW264.7 cells were pulsed with hsp90, hsp70, cahreticulin, 
or gp96, as a positive control, by themselves, or chaperoning the AHl-19 peptide. 
Chaperonmg of peptides by the HSPs was accomplished in vitro as previously described 

20 (Blachere et al. 1997, J.Exp. Med. 186:1315-1322; Basu and Srivastava 1999, J. Exp. 
Med. 189:797-802). T cells specific for L^/AH-1 secreted IFN-y when the RAW264.7 cells 
were pulsed with complexes of hsp90, hsp70, CRT or gp96 with AHl-19, but not when the 
HSPs were not complexed with the peptide (FIG. 8). Pulsing of RAW264.7 cells with AHl- 
19 alone did not lead to surface loading of L*^ molecules and consequent stimulation of T 

25 cells. Further, RAW264.7 cells pulsed with complexes of serum albumin with AHl-19, also 
failed to stimulate L^/AHl-specific T cells, thus indicating the specific requirement of HSPs 
for introducing the chaperoned peptides into the endogenous presentation pathway (FIG. 8). 

Gp96, hsp90, hsp70 and CRT engage a common receptor. Does each HSP have a 
unique receptor or do they share a common receptor? This question was addressed by three 

30 independent criteria : by measuring re-presentation of gp96-chapen)ned AHl-19 (as in FIGS. 
7 and 8) in the presence of excess and titrated quantities of fi:ee (i.e. not complexed to AHl- 
19) gp96, hsp90, hsp70 or serum albumin, by testing if macroglobulin, a known ligand for 
CD91, a receptor for gp96, can inhibit re-presentation of peptides chaperoned by gp96, 
hsp90, hsp70 or CRT, and finally, if anti-CD91 antibody can inhibit re-presentation of 

35 peptides chq)eroned by some or all the HSPs. 
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The gp96-AHl-19 complex was added to RAW264.7 cultures at a fixed final 
concentration of 40 |ig/ml, while the competing HSPs or serum albumin were added at 
concentrations between (200-800) jig/ml. It was observed (FIG. 9A) that all 3 competing 
HSPs could inhibit re-presentation of gp96-chaperoned AHl-19, albeit with different 

5 eflBciencies. Gp96 was able to compete with itself, while hsp90 was an even better 

competitor than gp96. Hsp70 was a less efficient competitor than gp96 but was a significant 
competitor. Albumin competed inefficiently. In quantitative terms, approximately 2 fold 
molar excess of hsp90, 6 fold molar excess of gp96, and a 13 fold molar excess of hsp70 
were required to inhibit by 50% the re-presentation of gp96-chaperoned peptides at a gp96 

10 concentration of 40 p.g/ml. All three HSPs were able to inhibit the re-presentation of gp96- 
chaperoned peptides completely at tiie highest concentration tested. This observation 
suggests that gp96, hsp90 and hsp70 utilize a single receptor albeit with differing 
specificities. 

In additional experiments, increasing quantities of macroglobulin were added to 
15 re-presentation assays where AHl-19 chaperoned by gp96, hsp90, hsp70 or CRT was re- 
presented by RAW264.7 cells, to L**/AH-1 specific T cells. macroglobulin was observed 
to inhibit, in a titratable manner, re-presentation of peptides chq)eroned by each of the four 
HSPs (FIG. 9B). Re-presentaion of peptides chaperoned by gp96, hsp70 and CRT was 
inhibited eqiially, while re-presentation of hsp90-chaperoned peptide was inhibited the most 
effectively, and almost completely at the highest concentration of macroglobulin tested. 
Serum albumin, when tested at the highest concentration, inhibited re-presentation only 
modestly. It may be noted that while the data in Fig. 9A show that the specific peptide- 
deficient HSPs inhibited re-presentation of gp96-AHl-19 complexes completely at the 
highest concentrations tested, macroglobuUn appears far less effective, in quantitative 

25 

terms, at inhibiting the re-presentation of peptides chaperoned by 3 of the 4 HSPs (FIG. 9B). 
However, this quantitative disparity disappears if one notes that the macroglobulin 
molecule is approximately 10 times larger in molecular mass than the average HSP molecule. 

A mouse monoclonal anti-CD91 IgG, antibody and isotype control antibodies were 
tested for their ability to inhibit re-presentation of peptides chaperoned by gp96, hsp90, 

30 

hsp70 and CRT. As before, the RAW264.7/AH1 system was utilized and the antibodies were 
added to re-presentation cultures at the concentrations indicated (Fig. 9C). Anti-CD91 
antibody was observed to inhibit, titratably and specifically, the re-presentation of AHl 
chaperoned by each of the 4 HSPs tested. The isotype control antibody did not inhibit re- 
presentation in any instance. Further, the inhibition mediated by the anti-CD91 antibody was 

35 

complete and uniform for each of the HSPs, radicating that CD91 is the sole receptor for 
eachofthe4HSPs. 
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Requirement of a functional proteasome complex for the representation ofgp96' 
chaperoned peptides by APCs. The re-presentation assay was carried out in presence or 
absence of the specific proteasome inhibitor, lactacystin. The peritoneal macrophages were 
treated or untreated with lactacystin for 2 hr and then cultured with gp96-VSV19 complex 

5 for anotiier 2 hr in presence or absence of the inhibitor. The cells w^e chromium labeled at 
the same time for 1 hr and &en washed and used as targets against CDS"^ cells specific for 
VSV8 in a 4 hr chromium release assay. Gp96-VSV19, lactacystin-untreated pulsed APCs 
were lyzed by VSV8-specific CDS"*" T cells (FIG. lOA). As observed previously for gp96 
(Suto and Srivastava 1995, supra) and for hsp70 (CastelUno et aL, 2000, supra), only a mall 

IQ proportion of pulsed APCs were lyzed by tiie APCs even at the highest E:T ratio tested (FIG. 
lOA), The APCs pulsed with VSV8 (througji surface charging) were lyzed in a titratable and 
more significant degree, indicating that the APCs were entirely capable of being lyzed. The 
basis of the selective lyzability of APCs re-presenting HSP-chaperoned peptides is still 
unclear. However, and regardless of this observation, the lactacystin-treated, gp96-VSV19 

25 pulsed APCs were not recognized by the VSVS-specific CD8^ T cells and were not lyzed by 
them (FIG. IDA). Inhibition of proteasomal function thus inhibits the processing of peptides 
chaperoned by gp96 (FIG. IDA). As other HSPs tested also use the same portal of entry (FIG. 
9), it is assumed that inhibition of proteasome function interferes with processing of peptides 
chaperoned by them as well. The data recently reported by Castellino et al for hsp70 are 

20 consistent with this inference. 

Re-presentation of gp96-chaperoned peptides by MHC I of the APCs requires a 
functional TAP. The requirement of TAP in re-presentation of gp96 chaperoned peptides by 
APCs was tested. In a re-presentation assay in vitro, gp96 purified fix>m liver or the same 
gp96 complexed with VSVl 9 was pulsed on to primary cultures of peritoneal macrophages 

25 derived from TAP +/+ or -/- mice. The pulsed APCs were used to stimulate CD8'^ T cell lines 
specific for KWS V8. After incubation for 20 hr, the culture supomatants were tested for 
release of IFN-y as a marker for T cell stimulation (FIG.lOB). It was observed that APCs 
firom TAP+/+ mice stimulated the CD8'*' T cells specifically when cultured in presence of 
gp96 complexed to VSVl 9 but APCs firom TAPl-A mice were unable to do so. This result 

30 indicates that gp96-chaperoned peptides must enter the endoplasmic reticulum through the 
TAP molecules, for being loaded on the MHC I molecules. As other HSPs tested also use the 
same portal of entry (FIG, 9), it is assumed that peptides chaperoned by other HSPs also 
require TAP for re-presentation. Part of the data recently reported by Castellino et al for 
hsp70 are consistent with this inference. 

35 hi studies in vivo, TAP1(-/-) (C57BL/6/SV129J) or wild type (C57BL/6) mice were 

immunized with the gp96-VSV19 complexes (50 |ig of gp96 complexed with 50 \ig of 
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VSV19), or VSV19 alone, or gp96 alone. Spleen cells of unmunized mice were cultured with 
the VSV8 and tested for cytotoxic activity on ^'Cr labeled EL4 cells or EL4 cells pulsed with 
the VSV8 peptide as targets. Spleen cells of wild type (C57BL/6) mice immunized with 
gp96-peptide complex showed VSV8-specific CTL activity whereas splenocytes of TAP! (- 

5 A) mice immunized with gp96-peptide complex showed no cytotoxic activity (FIG. 1 OC). It 
may be argued that the lack of CTL activity m TAP-/- mice is a result of the poor loading 
and stabiUty of MHC I molecules in general^ rather than because of a specific block in re- 
presentation. While this is possible, and is difficult to entirely refute, we are easily able to 
generate VSVS-specific CTLs in TAP-/- mice as in TAP+/+ mice by immunization with 

IQ VSV8 peptide in incomplete Freund's adjuvant (data not shown). Sandberg et al. (1996) have 
reported similar data. In any case, the data from re-presentation assays in vitro using APCs 
from TAP+/+ and -/- mice (FIG. lOB) demonstrate the TAP requirement for re-presentation 
convincmgly and without the complexity introduced by the experiment in vivo (FIG. IOC). 

15 

6.4 DISCUSSION 

The a2M receptor, which is also designated CD91, was initially identified as a 
protein related to the low density lipoprotein (LDL) receptor Related Protein (LRP) 
(Strickland et al, 1990, J. Biol Chem.265: 17401-17404; Kristensen et al, 1990, FEES Lett. 

20 276: 15 1-155). The protein consists of an -420 kDa a subunit, an 85 kDa P subunit and a 39 
kDa tightly associated molecule (RAS). The a and p subunits are encoded by a single 
transcript of -15 Kb in size (Van Leuven et al, 1993, Biochun. Biophys. Acta. 1173:71-74. 
The receptor has been shown to be present in ceUs of the monocjrtic lineage and in 
hepatocytes, fibroblasts and keratinocytes. CD91 has been shown previously to bind the 

25 activated form of the plasma glycoprotein a2M, which binds to and inhibits a wide variety of 
endoproteinases. a2M receptor also binds to other ligands such as transforming growth 
factor p (O Connor-McCourt et al, 1987, J. Biol. Chem. 262:14090-14099), platelet-derived 
growth factor (Huang et al, 1984, Proc. Natl. Acad. Sci. U.S.A. 81:342-346), and fibroblast 
growth factor (Dennis et al, 1989, J. Biol. Chem. 264:7210-7216). a2M is thus believed to 

30 regulate, and specifically diminish, the activities of its various ligands. Complexed with 
these various ligands, a2M binds a2M receptor on the cell surface and is internalized through 
receptor-mediated endocytosis. Uptake of a2M-complexed ligands has been assumed thus 
far to be the primary fimction of the a2M receptor, although a role for it in lipid metaboUsm 
is also assumed. a2M receptor ligands other than a2M, such as tissue-specific plasminog^i 

35 activator-inhibitor complex (Orth et al, 1992, Proc. Natl Acad. Sci. U.S.A. 89:7422-7426) 

and urokinase-PAIl complex (Nykjaer et al, 1992, J. Biol. Chem. 267:14543-14546), have v 
been identified. These Ugands attest to a role for a2M receptor in clearing a range of 
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extracellular, plasma products. 

The studies reported here show that the heat shock proteins gp96, hsp90, hsp70, and 
calreticulin are additional ligands for the a2M receptor The hximan gp96-codiQg gene has 
been mapped previously by us at chromosome 12 (q24.2->q24.3) (Maid et al, 1993, Somatic 

5 Cell Mol. Gen. 19:73-81). It is of interest in this regard that the a2M recq)tor gene has been 
mapped to the same chromosome and at a not too distant location (ql3-^ql4) (Hilliker et al 
Genomics 13:472-474). Gp96 binds a2M receptor directly and not through other ligands 
such as a2M. Homogenous preparations of gp96, in solution, or cross-linked to a solid 
matrix, bmd to the a2M receptor. Indeed, the major ligand for the a2M receptor, a2M, 

jQ actually inhibits interaction of gp96 with a2M receptor, instead of promoting it, providing 
evidence that gp96 is a direct ligand for the aZM receptor. The 80 kDa protein shown to 
bind gp96 is clearly an amino terminal degradation product of the a subunit of the a2M 
receptor. Degradation products of the a2M receptor in this size range have also been 
observed in previous studies (Jensen et al, 1989, Biochem. Arch. 5:171-176), and may 

]^ 5 indicate the existence of a discrete ectodomain in the a2M receptor which may be 
particularly sensitive to proteolytic cleavage. 

The studies shoAvn here also indicate that the a2M receptor is also engaged by hsp90, 
hsp70 and calreticulin. This observation is surprising in light of the fact that hsp70, 
calreticulin and hsp90/gp96 have no obvious structural similarities with each other. In 

20 another context, HSPs have presented us with this dilemma before: many of the various 
HSPs have no obvious homologies with each other .and yet they appear to bind many of the 
same peptides (Ishii et al, 1999, J. Immunol. 162(3):1303-1309; Breloer etal, 1998, Eur. J. 
Immunol. 28(3):1016-1021). It remains to be seen if gipl70, which belongs to the extended 
hsp70 family and hspl 10, which has no homology to any of the other HSPs, shall share the 

25 CD91 receptor. The multiple common properties of the HSPs which share the Fourth 
Paradigm (Srivastava P.K., 1994, Experientia 50(1 1-12):1054-1060), i.e. peptide-binding, 
interacting with APCs fhrougji a common receptor, ATP-binding and ATPase activity, 
strongly suggest that these molecules must share conformational similarities which are not 
obvious from their primary sequence. Crjrstallographic analyses of the HSPs have begun to 

30 reveal their structure (Zhu et al, 1996, Science 272:16024614; Prodromou et al, 1997, Cell 
90:65-75; Stebbins et al, 1997, Cell 89:239-250), and shall shed light on this question. 

The observations that a2 macroglobulin and anti-CD91 antibodies inhibit 
re-presentation by each of the four HSPs completely, indicate that CD91 is the only receptor 
for the 4 HSPs. Considering the increasingly obvious role which the HSPs play in innate 

35 (Basu et al, 2000, Int. Immunol. 12(1 1):1539-1546) and adaptive immune response, this 
observation is somewhat counter-intuitive. However, the data on complete inhibition by two 
independent means (FIG.. 9) are quite compelling. We have noticed earlier, and we report 
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here, significant differences between hsp70 and hsp90/gp96 in their ability to compete for 
binding to gp96 receptors (Binder et al, 2000, J. Lmnunol. 165:2582-2587). Another group 
has also observed similar differences between gp96 and hsp70 (Amold-Schild et aly 1999, 
162:3757-3760). These differences are not inconsistent with our present report pointing to a 

5 single receptor for the 4 HSPs. They simply suggest that the various HSPs interact with a 
single receptor with widely differing afiSnities. Castellino et al have recently demonstrated 
re-presentation of hsp70-chq>eroned peptides by APCs through receptor-mediated uptake 
and have suggested the existence of receptors of different affinity classes for single HSPs. 
This argument is biologically cogent, but is not siq)ported by our present data. 

Once the HSP-peptide complex binds to the receptor, p^tides chaperoned by the 
HSPs must enter the APC along with the HSP. The studies reported here address the 
downstream events solely with respect to gp96 in the assumption that if all HSPs enter 
through the same portal, the downstream events must be identical or similar for peptides 
chaperoned by each of them. Our observations suggest that the peptides go from the 

J 5 endosome to the cytosol, to the ER and then to the secretory pathway to be re-presented on 
the surface. The transit through the cytosol is established through the proteasome 
requirement as well as through the TAP requirement of re-presentation. There is no known 
mechanism for transit of molecules from vesicular to soluble compartment although 
precedents certainly exist (Chiang et al, 1989, Science 246:382-385). Exploration of this 

20 pathway shall, without doubt, open a new window into our understanding of intracellular 
traffic of proteins. Castellino et al. have reported recently on the events as they occur 
downstream of receptor-mediated uptake of hsp70-peptide complexes by APCs (Castellino et 
al, 2000, supra). Our observations wifli a different HSP (gp96) are entirely consistent with 
that version of events and buttress the notion that the same portal of entry is used by all the 

25 peptide-chaperoning HSPs for re-presentation. 

As shown here, the heat shock protein-a2M receptor interaction provides a new type 
of function for a2M receptor, a fimction of a sensor, not only of the extracellular 
environment with its previously known plasma-based ligands, but also a sensor of the 
intracellular milieu as well. HSPs such as gp96 are obligate intracellular molecules and are 

30 released into the extracellular milieu only under conditions of necrotic (but not apoptotic) 
cell death. Thus, the a2M receptor may act as a sensor for necrotic cell death (see FIG. 1 1), 
just as the scavenger receptor CD36 and the recently identified phosphatidyl serine-binding 
protein act as sensors of apoptotic cell death and receptors for apoptotic cells (Savill et al, 
1992, J. Clin. Invest.90: 1513-1522; Fadok et al, 2000, Nature 405:85-90). Interaction of the 

35 macrophages with the apoptotic cells leads to a down-regulation of the inflammatory 

cytokines such as TNF (Fadok et al, 2000, supra), while gp96-APC mteraction leads to re- 
presentation of gp96-chaperoned peptides by MHC I molecules of the APC, followed by 



-86- 



wo 01/92474 



PCT/USOl/18041 



stimulation of antigen-specific T cells (Suto and Srivastava, 1995, supra) and, in addition, 
secretion of pro-inflammatory cytokines such as TNF, GM-CSF and IL-12. Interestingly, 
a2M, an independent ligand for the a2M receptor, inhibits representation of gp96- 
chaperoned peptides by macrophages. This observation suggests that re-presentation of 

5 gp96-chaperoned peptides can not occur physiologically in blood, but only within tissues as 
a result of localized necrotic cell death. This is consistent with the complete absence of gp96 
or other HSPs in blood under all conditions tested. Together, these observations point 
towards a possible mechanism whereby the release of HSPs in the blood as a result of severe 
tissue injury and lysis will not lead to a systemic and lethal pro-inflammatory cytokine 

IQ cascade. 

It is possible, therefore, that the a2M receptor renders it possible for the APCs to 
sample (i) the extracelliilar milieu of the blood through a2M and other plasma ligands and 
(ii) the intracellular milieu of the tissues through HSPs, particularly of tiie gp96 family. The 
former permits APCs to implement their primordial phagocytic function, while the latter 
15 allows them to execute its innate and adaptive immunological functions. Viewed in another 
perspective, recognition of apoptotic cells by APCs through CD36 or phophatidyl serine, 
leads to anti-inflammatory signals, while interaction of the APC with necrotic cells through 
a2M receptor leads to pro-inflammatory innate and adaptive immune responses (see 
Srivastava et aL, 1998, Immunity 8: 657-665). 

20 

The invention is not to be limited in scope by the specific embodiments described 
which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. 
25 Indeed various modifications of the invention, in addition to those shown and described 
herein will become apparent to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of the 
appended claims. 

All references cited herein, including patent applications, patents, and other 
3Q publications, are incorporated by reference herein in their entireties for aU purposes. 



35 
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WHAT IS CLAIMED IS : 

1 . A method for identifying a compound that modulates an HSP-a2M 
receptor-mediated process, comprising: 

5 (a) contacting a test compound with a heat shock protein and an alpha (2) 

macroglobulin receptor; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity or 
expression, 

such that if the level of activity or eTqpression measured in (b) differs &om the level of alpha 
^0 (2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. 

2. The method of Claim 1, in which the compound identified is an antagonist 
which interferes with the interaction of the heat shock protein with the alpha (2) 

J 5 macroglobulin receptor, further comprising the step of: 

(c) detemaining whether the level interferes with the interaction of the heat shock 
protein and the alpha (2) macroglobulin receptor. 

3. The method of Claim 1, in which the test compound is an antibody specific 
20 for the alpha (2) macroglobulin receptor, 

4. The method of Claim 1, in which the test compound is an antibody is specific 
for alpha (2) macroglobulin. 

25 5. The method of Claim 1 , in which the test compound is an antibody is specific 

for a heat shock protein. 

6. The method of Claim 1, in which the test compound is a small molecule. 

30 7. The method of Claim 1, in which the test compound is a peptide. 

8. The method of Claim 7, in which the peptide comprises at least 5 consecutive 
amino acids of the alpha (2) macroglobuhn receptor (SEQ ID NO.: 7). 

35 9. The method of Claim 7, in which the peptide comprises at least 5 consecutive 

amino acids of alpha (2) macroglobulin (SEQ ID NO.: 4). 
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10. The method of Claim 7, in which the peptide comprises at least 5 consecutive 
amino acids of a heat shock protein sequence. 

1 1 . The method of Claim 1 , in which the compound is an agonist which enhances 
5 the interaction of the heat shock protein with the alpha (2) macroglobulin receptor. 

12. The method of Claim 1 in which the HSP-a2M receptor-mediated process 
affects an autoimmune disorder, a disease or disorder involving disruption of antigen 
presentation or endocj^osis, a disease or disorder involving cytokine clearance or 
inflammation, a proliferative disorder, a viral disorder or other infectious disease, 
hypercholesterolemia, Alzheimer's disease, diabetes, or osteoporosis. 

13. A method for identifying a compound that modulates an HSP-a2M 
receptor-mediated process, comprising: 

1 5 (a) contacting a test compound with a heat shock protein and an alpha (2) 

macroglobulin receptor-expressing cell; and 
(b) measuring the level of alpha (2) macroglobulin receptor activity or expression 
in the cell, 

such that if the level of activity or expression measured in (b) differs from the level of alpha 
20 (2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. 

14. The method of Claim 1 or 13 wherein the alpha (2) macroglobulin receptor 
activity measured is the ability to interact with a heat shock protein. 

25 

1 5 . The method of Claim 1 3 wherein the heat shock protein is non-co valently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to re-pres^t the antig^c peptide. 

30 16. The method of Claim 1 3 wherein the heat shock protein is non-covalently 

associated with an antigraic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to stimulate a cytotoxic T cell response against the antigenic peptide. 

17. A method for identifying a compound that modulates the binding of a heat 
35 shock proteia to the a2M receptor, comprising; 

(a) contacting a heat shock protein with an alpha (2) macroglobulin receptor, or 
fragment, or analog, derivative or mimetic thereof, in the presence of a test 
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(b) measuring the amount of heat shock protein bound to the alpha (2) 

macroglobulin receptor, or fragment, analog, derivative or mimetic thereof, 
such that if the amoxmt of bound heat shock protein measured in (b) differs from the amount 
5 of bound heat shock protein measured in the absence of the test compound, then a compound 
that modulates the binding of an HSP to the a2M receptor is identified. 

1 8. The method of Claim 65 wherem the solid surface is a microtiter dish. 

IQ 19. The method of Claim 17 wherein the amount of bound heat shock protein is 

measured by contacting the cell with a heat shock protein-specific antibody. 

20. The method of Claim 17 wherein the heat shock protein is labeled and the 
amount of bound heat shock protein is measured by detecting the label. 



15 



2 1 . The method of Claim 20 wherein the heat shock protein is labeled with a 
fluorescent label. 



22. A method for identifying a compound that modulates heat shock protein- 
20 mediated antigen presaitation by alpha (2) macroglobulin receptor-expressing cells 

comprising: 

(a) adding a test compound to a mixture of alpha (2) macroglobulin receptor- 
expressing cells and a complex consisting essentially of a heat shock protein 
nonco valently associated with an antigenic molecule, under conditions 

25 conducive to alpha (2) macroglobulin receptor-mediated endocytosis; 

(b) measuring the level of stimulation of antigen-specific cytotoxic T cells by the 
alpha (2) macroglobulin receptor-expressing cells, 

such that if the level measured in (b) differs from the level of said stimulation in the absence 
of the test compound, then a compound that modulates heat shock protein-mediated antigen 
30 presentation by alpha (2) macroglobulin receptor-e3q)ressing cells is identified. 

23. A method for detecting a heat shock protein-alpha (2) macroglobulin receptor- 
related disorder in a mammal comprising measuring the level of activity from an HSP-alpha 
(2) macroglobulin receptor-mediated process in a patient sample, such that if the measured 

35 level differs from the level found in clinically normal individuals, then a heat shock protein- 
alpha (2) macroglobulin receptor-related disorder is detected. 
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24. The method of Claim 23 comprising contacting a sample derived from a 
patient with an antibody specific for the alpha (2) macroglobulin receptor under conditions 
such that immimospecific binding by the antibody. 

5 25. The method of Claim 23 comprising contacting a sample derived from a 

patient with an antibody specific for a heat shock protein under conditions such that 
immunospecific binding by the antibody. 

26. The method of Claim 23 comprising contacting a sample derived &om a 
IQ patient with an antibody specific for an HSP-a2M complex under conditions such that 

immunospecific binding by the antibody. 

27. A method for modulating an immune response comprising administering to a 
mammal a purified compound that modulates the interaction of a heat shock protein with the 

^5 alpha (2) macroglobulin receptor. 

28. The method of Claim 27, in which the compoimd is an agonist which 
enhances the interaction of the heat shock protein and the alpha (2) macroglobulin receptor. 

20 29. A method for treating an autoimmune disorder comprising administering to a 

mammal in need of such treatment a purified compound that interferes with the interaction of 
a heat shock protein with the alpha (2) macroglobulin receptor. 

30. The method of Claim 29 in which the compound is an antagonist that 
25 interferes with the interaction between the heat shock protem and the a2M receptor. 

3 1 . The method of Claim 30, in which the antagonist is an antibody specific for 
alpha (2) macroglobulin receptor. 

30 32. The method of Claim 30, in which the antagonist is an antibody specific for a 

heat shock protein. 

33 . The method of Claim 30, in which the antagonist is a small molecule. 
35 34. The method of Claim 30, in which the antagonist is a peptide. 

35. The method of Claim 30, in which the peptide comprises at least 5 
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consecutive amino acids of alpha (2) macroglobulin receptor (SEQ ID NO.:l). 

36. The method of Claim 30, in which the peptide comprises at least 5 
consecutive amino acids of alpha (2) macroglobulin (SEQ ID NO.: 3). 

5 

37. The method of Claim 30» in which the peptide comprises at least 5 
consecutive amino acids of a heat shock protein sequence. 



38. A method for treating an autoimmune disorder comprising administering to a 
IQ mammal in need of such treatment a recombinant cell that expresses an alpha (2) 

macroglobulin receptor which decreases the uptake of a heat shock protein by a functional 
alpha (2) macroglobulin receptor. 

39. A method for increasing the immunopotency of a cancer cell or an infected 

J 5 cell comprising transforming said cell with a nucleic acid comprising a nucleotide sequence 
that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) macroglobulin 
receptor polypeptide. 



40. A method for increasing the immimopotency of a cancer cell or an infected 
20 cell comprising: 

(a) transforming said cell with a nucleic acid comprising a nucleotide sequence 
that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide, and 

(b) administering said cell to an individual in need of treatment, 
so as to obtain an elevated inamune response. 



41 . A recombinant cancer cell transformed with a nucleic acid comprising a 
nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide. 

30 

42. A recombinant infected cell transformed with a nucleic acid comprising a 
nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide. 

a< 43. The recombinant cell of Claim 41 or 42 which is a human cell. 



44, A kit, comprising in one or more containers: (a) an anti-a2M receptor 
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antibody or a nucleic acid probe capable of hybridizing to an a2M receptor nucleic acid, (b) a 
purified heat shock protein, nucleic acid encoding a heat shock protein, or cell expressing a 
heat shock protein; and (c) instructions for use in detecting a heat shock protein-alpha (2) 
macroglobulin recq)tor-related disorder. 

5 

45. The kit of Claim 44 wherein the antibody or nucleic acid probe is labeled with 
a detectable marker. 

46. The kit of Claim 44 further comprising a labeled macroglobulin receptor 
}Q polypeptide. 

47. A kit, in one or more containers, comprising: (a) a purified heat shock 
protein, nucleic acid encoding a heat shock proteiu, or cell expressing a heat shock protein; 
and (b) an alpha (2) macroglobxilin receptor polypeptide, nucleic acid encoding an alpha (2) 

} 5 macroglobulin receptor polypeptide, or cell expressing an alpha (2) macroglobulin receptor 
polypeptide. 

48. The kit of Claim 47 in which the alpha (2) macroglobulin receptor 
polypeptide, nucleic acid encoding an alpha (2) macroglobulin receptor polypeptide, or cell 

20 expressing an alpha (2) macroglobulin receptor polypeptide is purified. 

49. The kit of Claim 47 fijrfher comprising instructions for use in treating an 
autoimmune disorder, an infectious disease, or a proliferative disorder. 

25 50. A method for identifying an a2M receptor fiagment capable of binding a heat 

shock protein, said method comprising: 

(a) contacting a heat shock protein, or peptide-binding fiagment thereof, with one 
or more alpha (2) macroglobulin receptor firagments; and 

(b) identifying an a2M receptor fragment which specifically binds to the heat 
30 shock protein, or peptide-binding firagment thereof 

51. A method for identifying an a2M receptor fragment capable of inducing an 
HSP-a2M receptor-mediated process, said method comprising: 

(a) contacting a heat shock protein with a cell expressing a2M receptor fragment; 

35 

(b) measuring the level of alpha (2) macroglobulin receptor activity in the cell, 
such that if the level of the HSP-a2M receptor-mediated process or activity measured in (b) 
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is greater than the level of alpha (2) macroglobulin receptor activity in the absence of the 
a2M receptor fragment, then an a2M recq)tor fragment capable of inducing an HSP-a2M 
receptor-mediated process is identified. 

5 52. The method of Claim 5 1 wherein the alpha (2) macroglobulin receptor 

activity measured is the ability to interact with the heat shock protein. 

53 . The method of Claim S 1 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 

IQ measured is the ability to re-present the antigenic peptide. 

54. The method of Claim 5 1 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to stimulate a cytotoxic T cell response against the antigenic peptide. 

15 

55. A method for identifying a heat shock protein fragment capable of binding an 
a2M receptor, said method comprising: 

(a) contacting an a2M receptor with one or more heat shock protein fragments; 
and 

(b) identifying a heat shock protein fragment which specifically binds to the a2M 
receptor. 

56. A method for identifying a heat shock protein fragment capable of inducing 
an HSP-a2M receptor-mediated process, said method comprising: 

(a) contacting an a2M receptor fragment with a cell expressing a heat shock 
protein; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity in the cell, 
such that if the level of the HSP-a2M receptor-mediated process or activity measured in (b) 
is greater than the level of alpha (2) macroglobulin receptor activity in the absence of said 
heat shock protein fragment, then a heat shock protein fragment capable of inducing an HSP- 
a2M receptor-mediated process is identified. 

57. The method of Claim 56 wherein the alpha (2) macroglobulin receptor 
activity measured is the ability to int^act with the heat shock protein firagment 

35 

58. The method of Claim 56 wherein the heat shock protein firagment is non- 
covalently associated with an antigenic peptide and the alpha (2) macroglobulin receptor 



25 



30 
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activity measured is the ability to re-present the antigenic peptide. 

59. The method of Claim 56 wherein the heat shock protein firagment is non- 
covalentiy associated with an antigenic peptide and the alpha (2) macroglobulin receptor 

5 activity measured is the ability to stimulate a cytotoxic T cell response against the antigenic 
peptide. 

60. A method for identifying a molecule that binds specifically to an a2M 
receptor, said method comprising: 

I Q (a) contacting an a2M receptor with one or more test molecules under conditions 

conducive to binding; and 
(b) identifying one or more test molecules that specifically bind to the a2M 
receptor. 

15 61. The method of Claim 60 wherein said test molecules are potential 

immunotherapeutic drugs. 

62. A method for screening for molecules that specifically bind to an a2M 
receptor comprising: 

20 (^) contacting an a2M receptor with one or more test molecules under conditions 

conducive to binding; and 
(b) determining whether any of said test molecules specifically bind to the a2M 
receptor. 



25 63. A method for identifying a compound that modulates the binding of an a2M 

receptor ligand to the a2M receptor comprising: 

(a) contacting an a2M receptor with an a2M receptor Ugand, or an a2M receptor- 
binding fragment, analog, derivative or mimetic thereof, in the presence of 
one or more test compounds; and 
3Q (b) measuring the amount of a2M receptor ligand, or fragment, analog, derivative 

or mimetic thereof, bound to the a2M receptor, 
such that if the amoxmt of bound a2M receptor ligand measured in (b) differs from the 
amount of bound a2M receptor measured in the absence of the test compound, then a 
compound that modulates the binding of an a2M receptor ligand to the a2M receptor is 
35 identified. 

64. The method of Claim 17 or 63, in which the alpha (2) macroglobulin receptor 
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contacted in step (a) is on a cell sur&ce. 



65. The method of Claim 17 or 63, wherein the alpha (2) macroglobulin receptor 
is immobilized to a solid surface. 

5 

66. The method of Claim 1 , 64, or 22 in which the heat shock protein is gp96. 

67. The method of Claim 1 , 64, or 22 in which the heat shock protein is hsp90. 

} Q 68. The method of Claim 1 , 64, or 22 in which the heat shock protein is hsp70. 

69. The method of Claim 1 , 64, or 22 in which the heat shock protein is 
caheticulin. 



J 5 70. A method for identifying a compound that modulates the interaction between 

the a2M receptor and an a2M receptor ligand, comprising: 

(a) contacting an a2M receptor with one or more test compounds; and 

(b) measuring the level of a2M receptor activity or expression, 

such that if the level of activity or ejqjression measured in (b) differs from the level of a2M 
20 receptor activity in the absence of one or more test compounds, then a compound that 

modulates the interaction between the a2M receptor and an a2M receptor ligand is identified. 



71 . The method of Claim 63 or 70 wherein the a2M receptor ligand is a2 
macroglobulin. 

25 

72. A method for identifying a compound that modulates antigen presentation by 
a2M receptor-expressing cells comprising: 

(a) adding one or more test compounds to a mixture of a2M receptor-expressing 
cells and a complex comprising an a2M receptor ligand and an antigenic 

30 molecule, under conditions conducive to a2M receptor-mediated endocytosis; 

(b) measuring the level of stimulation of antigen-specific cytotoxic T cells by the 
a2M receptor-expressing cells, 

such that if the level measured in (b) differs from the level of said stimulation in the absence 
of the one or more test compounds, then a compound that modulates antigen presentation by 
33 a2M receptor-expressing cells is identified. 



73. The method of Claim 22 or 72, in which the measuring stimulation of antigen- 
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specific cytotoxic T cells by the a2M receptor-expressing cells of step (b) comprises: 

(i) adding the alpha (2) macroglobulin recq)tor-expressing cells formed 
in step (a) to T cells under conditions conducive to the activation of 
the T cells; and 

5 (ii) comparing the level of activation of said cytotoxic T cells with the 

level of activation of T cells by an alpha (2) macroglobulin receptor- 
expressing cell formed in the absence of the test compound, 
wherein an increase of decrease in level of T cell activation indicates that a compound that 
modulates heat shock protein-mediated antigen presmtation by alpha (2) macroglobulin 
IQ receptor-expressing cells is identified. 

74. A method for modulating an immune response comprising administering to a 
mammal a purified conq)ound that binds to the a2M receptor, in an amount effective to 
modulate an immime response in the mammal. 

15 

75. A method for treating or preventing a disease or disorder comprising 
administering to a mammal a purified compound that binds to the a2M receptor, in an 
amount effective to treat or prevent the disease or disorder in the mammal. 

20 76. The method of Claim 75 wherein the disease or disorder is cancer or an 

infectious disease. 

77. A method for treating an autoimmune disorder comprising administering to a 
mammal in need of such treatment a purified compound that binds to the a2M receptor, in an 

25 amoimt effective to treat an autoimmime disorder in the mammal. 

78. A method for stimulating an immune response in a patient comprising 
administering to said patient blood which has been withdrawn firom said patient and treated 
to remove an a2M receptor ligand. 

30 

79. The method of Claim 78 fiirther comprising administering to said patient a 
heat shock protein or a heat shock protein-antigenic peptide complex. 

80. A method for stimulating an immime response in a patient comprising: 

35 (a) removing a a2M receptor ligand from blood withdrawn from said patient; and 

(b) returning at least a portion of the a2M receptor ligand-depleted blood to said 

patient. 
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81. 



A method for stimulating an immune response in a pat^^ "\ comprising: 



(a) withdrawing blood from said patient; 

(b) removing a a2M receptor ligand jfrom said blood; and 

(c) returning at least a portion of the a2M receptor ligand*depleted blood to said 



82. The method of Claim 8 1 further comprising after step (a) and before step (c) 
the st^ of adding a heat shock protein or a heat shock protein-antigenic peptide complex to 
said blood. 

10 

83. The method of Claims 80 or 81 wherein removing a a2M receptor ligand from 
the blood comprises the step of contacting the blood with a solid phase attached to a cx2M 
receptor ligand-binding molecule for a time period and under conditions sufficient to allow 
binding of a2M receptor ligand to the a2M receptor ligand-binding molecule solid phase. 



84. The method of Claim 83 wherein the oi2M receptor Ugand-binding molecule is 
a2M receptor, or a fragment thereof 

85. The method of Claim 83 wherein said a2M receptor Ugand-binding molecule 



86. The method of Claim 85 wherein the a2M receptor ligand-binding molecule is 
an a2M receptor Hgand-specific antibody, or a fragment thereof 

87. The method of Claims 80 or 81 wherein an apheresis system is used in said 
removing step. 

88. The method of Claim 81 wherein blood is withdrawn manually in said 
withdrawing step. 



89. The method of Claim 80 or 8 1 wherein said removing step comprises 
separating the plasma from said blood and treating said plasma to remove said a2M receptor 
ligand. 

90. The method of Claim 78 wherein said blood is administered to said patient by 



5 patient 



15 



20 does not bind a heat shock protein. 



30 



syringe. 
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91 . The method of Claim 78 wherein said blood is administered to said patient by 
an intravenous drip. 

92. Hie method of Claun 80 or 81 wherein said blood is returned to said patient 
5 by syringe. 

93. The method of Claim 80 or 8 1 wherein said blood is returned to said patient 
by an intravenous drip. 

IQ 94. A kit comprising in one or more containers a solid phase chromatography 

column with a purified a2M receptor ligand binding molecule attached thereto, such that 
withdrawn blood can be run over the column to deplete the blood of a a2M receptor ligand. 

95. The kit of Claim 94 wherein the a2M receptor ligand binding molecule does 
J5 not bind heat shock proteins. 

96. The method of Claim 78, 80, or 81 wherein the a2M receptor ligand is a2M, a 
lipoprotein complex, lactoferrin, tissue-type plasminogen activator, urokinase-type 
plasminogen activator, or an exotoxin 

20 



25 



30 



35 
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1. Specific binding of HSPs and a, -macroglobulin to primaiy cultures and 
tdl Unes of several histological origins* 
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^S^SSS ?f CTGAGGAGGC GGAAACGGGG GAGCCCCTAG TGCTCCATCA 60 

SSSStcS W??SSg? c?SS^^^ CCCOCCACCC aCCTCCTCC ig 

cfrrrrr?^ llllln^^ TCC6AGATGG GGCTGTGAGC TTCGCCCTGG 180 

6TAAAGCAGG 6GTGAAGGGT TCGAATTTGG GGGCAGGGGG 240 

CGCACCCGCG TCAGCAGGCC CTTCCCAGGG GGCTCGGAAC TGTACCATTT CACCTATGCC 300 

rS?rSr^ lllr^rl^^i GAAGGATAAG ATAGAAgJJ? JgSgSJgG S^SSSgG ^SJ 

^rwS^^^ ^GCGAGGACA AGAAGTAACA GGACCAGAGG GTGGGGGCTG 420 

CTGTTTGCAT CGGCOCACAC C ATG CTG ACC CCG CCG TTG CTG CTG CTC GTG 471 

Met Leu Thr Pro Pro Leu Leu Leu Leu Val 
.1 5 10 

CCG CTG CTT TCA GCT CTG GTC TCC GGG GCC ACT ATG GAT GCC OCT AAA 519 
Pro Leu Leu Ser Ala Leu Val Ser Gly Ala Thr Met Asp Ala Pro Lys 
15 20 25 

ACT TGC AGC CCT AAG CAG TTT GCC TGC AGA GAC CAA -ATC ACC TGT ATC 567 
Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg Asp Gin He Thr Cys He 
30 35 . * 40 

TCA AAG GGC TGG CGG TGT GAC GGT GAA AGA GAT TGC CCC GAC GGC TCT 615 
Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg Asp Cys Pro Asp Gly Ser 
45 50 55 

GAT GAA GCC OCT GAG ATC TGT CCA CAG AGT AAA GCC CAG AGA TGC CCG 663 
Asp Glu Ala Pro Glu He Cys Pro Gin Ser Lys Ala Gin Arg Cys Pro 
60 65 70 

CCA AAT GAG CAC AGT TGT CTG GGG ACT GAG CTA TGT GTC CCC ATG TCT 711 

Pro Asn Glu His Ser Cys Leu Gly Thr Glu Leu Cys Val Pro Met Ser 

■'S SO . 85 90 , 

CGT CTC TGC AAC GGG ATC CAG GAC TGC ATG GAT GGC TCA GAC GAG GGT 759 
Arg Leu Cys Asn Gly He Gin Asp Cys Met Asp Gly Ser Asp Glu Gly 
95 100 105 

GCT CAC TGC CGA GAG CTC CGA GCC AAC TGT TCT CGA ATG GGT TGT CAA 807 
Ala His Cys Arg Glu Leu Arg Ala Asn Cys Ser Arg Met Gly Cys Gin 
110 lis ^ 120 

CAC CAT TGT GTA CCT ACA CCC AGT GGG CCC ACG TGC TAG TGT AAC AGC 855 
His His Cys Val Pro Thr Pro Ser Gly Pro Thr Cys Tyr Cys Asn Ser- 
125 130 135 

AGC TTC CAG CTC GAG GCA GAT GGC AAG ACG TGC AAA GAT TTT GAC GAG 903 
Ser Phe Gin Leu Glu Ala Asp Gly Lys Thr Cys Lys Asp Phe Asp Glu 
140 145 150 

TGT TCC GTG TAT GGC ACC TGC AGC CAG CTT TGC ACC AAC ACA GAT GGC 95'i 
Cys Ser Val Tyr Gly Thr Cys Ser Gin Leu Cys Thr Asn Thr Asp Gly 
155 160 165 170 

TCC TTC ACA TGT GGC TGT GTT GAA GGC TAC CTG CTG CAA CCG GAC AAC 999 
Ser Phe Thr Cys Gly Cys Val Glu Gly Tyr Leu Leu Gin Pro Asp Asn 
175 180 . 185 

CGC TCC TGC AAG GCC AAG AAT GAG CCA GTA GAT CGG CCG CCA GTG CTA 1047 
Arg Ser Cys Lys Ala Lys Asn Glu Pro Val Asp Arg Pro Pro Val Leu 
190 195 200 
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CTG ATT 6CC AAC TCT CAG AAC ATC CTA GCT AOG TAG CTG AGT GGG GCC 1095* 
Leu He Ala Asn Ser Gin Asi\ He Leu Ala Thr Tyr Leu Set Gly Ala 
205 210 215 

CAA GTG TCT ACC ATC ACA CCC ACC AGC ACC CGA CAA ACC ACG GCC ATG 1143 
Gin Val Ser Thr He Thr Pro Thr Ser Thr Arg Gin Thr Thr Ala Met 

220 225 • . 230 

GAC TTC AGT TAT GCC AAT GAG ACC GTA TGC TGG GTG CAC GTT GGG GAC llSl' 
Asp Phe Ser Tyr Ala Asn Glu Thr Val Cys Trp Val His Val Gly Asp 
235 240 • 245 250 

AGT 6CT GCC CAG ACA CAG CTC AAG TGT GCC CGG ATG CCT GGC CTG AA6 123S 
Ser Ala Ala Gin Thr Gin Leu Lys Cys Ala Arg Met Pro Gly Leu Lys 
255 260 265 

GGC TTT GTG GAT GAG CAT ACC ATC AAC ATC TCC CTC AGC CTG CAC CAC 1287 
Gly Phe Val Asp Glu His Thr He Asn He Ser Leu Ser Leu His His 
270 . . s - 275 280 

GTG. GAG CAG ATG GCA ATC GAC TGG CTG ACG GGA AAC TTC TAC TTT GTC 1335 
Val Glu Gin Met Ala He Asp Trp Leu Thr Gly Asn Phe Tyr Phe Val 
285 290 * 295 

GAC GAC ATT GAC GAC AGG ATC TTT GTC TGT AAC CGA AAC GGG GAC ACC 1383 
Asp Asp He Asp Asp Arg He Phe Val Cys Asn Arg Asn Gly Asp Thr 
300 305 310 

TGT GTC ACT CTG CTG GAC CTG GAA CTC TAC AAC CCC AAA GGC ATC GCC 1431 
Cys Val Thr Leu Leu Asp Leu Glu Leu Tyr Asn -Pro Lys Gly He Ala 
315 320 325 330 

TTG GAC CCC GCC ATG GGG AAG GTG TTC TTC ACT GAC TAC GGG CAG ATC 1479 
Leu Asp Pro Ala Met Gly Lys Val Phe Phe Thr Asp Tyr Gly Gin He 
335 340 345 

CCA AAG GTG GAG CGC TGT GAC ATG GAT GGA CAG AAC CGC ACC AAG CTG 1527 ' 
Pro Lys Val Glu Arg Cys Asp Met Asp Gly Gin Asn Arg Thr Lys Leu 
350 355 360 



GTG GAT AGC AAG ATC GTG TTT CCA CAC GGC ATC ACC CTG GRC CTG GTC 1575 
Val Asp Ser Lys He Val Phe Pro His Gly He Thr Leu Asp Leu Val * 
365 370 375 

AGC CGC CTC GTC TAC TGG GCG GAC GCC TAC CTA GAC TAC ATC GAG GTG 1623 
Ser Arg Leu Val Tyr Trp Ala Asp. Ala Tyr Leu Asp Tyr He Glu Val 
380 385 390 

GTA GAC TAC GAA GGG AAG GGT CGG CAG ACC ATC ATC CAA GGC ATC CTG 1671 
Val Asp Tyr Glu Gly Lys Gly Arg Gin Thr He He Gin Gly He Leu 
395 400 405 410 

ATC GAG CAC CTG TAC GGC CTG ACC GTG TTT GAG AAC TAT CTC TAC GCC 1719 
He Glu His Leu Tyr Gly Leu Thr Val Phe Glu Asn Tyr Leu Tyr Ala 
415 420 425 

ACC AAC TCG GAC AAT GCC AAC ACG CAG CAG AAG ACG AGC GTG ATC CGA 1767 
Thr Asn Ser Asp Asn Ala Asn Thr Gin Gin Lys Thr Ser Val He Arg 
430 435 440 
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GTG AAC CGG TTC AAC AGT ACT GAG TAG CAG 6TC GTC ACC CGT GTG GAC XBlS 
Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin Val Val Thr Arg Val Asp 
445 450 455 

AAG GGT GGT GCC CTG CAT ATC TAC CAC CAG CGA CGC CAG CCC CGA GTG 1863 
hys Gly Gly Ala Leu His lie Tyr His Gin Arg Arg Gin Pro Arg Val 
460 465 470 

CGG AGT CAC GCC TGT GAG AAT GAC CAG TAC GGG AAG CCA GGT GGC TGC ISll' 
Arg Ser His Ala Cys Glu Asn Asp Gin Tyr Gly Lys Pro Gly Gly Cya 
475 480 • 485 490 

TCC GAC ATC TGC CTC CTG GCC AAC AGT CAC AAG GCA AGG ACC TGC AGG 1959 
Ser Asp He Cys Leu Leu Ala Asn Ser His Lys Ala Arg Thr Cys Arg 
495 500 505 

TGC AGG TCT GGC TTC AGC CTG GGA AGT GAT GGG AAG TCT TGT AAG AAA 2007 
Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp Gly Lys Ser Cys Lys Lys 
510 515 520 

CCT GAA CAT GAG CTG TTC CTC GTG TAT GGC AAG GGC CGA CCA GGC ATC 2055 
Pro Glu His Glu Leu Phe Leu Val Tyr Gly Lys Gly Arg Pro Gly lie 
525 530 535 

ATT AGA GGC ATG GAC ATG GGG GCC AAG GTC CCA GAT GAG CAC ATG ATC 2103 
He Arg Gly Met Asp Met Gly Ala Lys Val Pro Asp Glu His Met He 
540 545 550 

Ccic ATC GAG AAC CTT ATG AAT CCA CGC GCT CTG GAC TTC CAC GCC GAG 2151 

Pro He Glu Asn Leu Met Asn Pro Arg Ala Leu Asp Phe His Ala Glu 

555 560 565 • 570 - 

ACC GGC TTC ATC TAC TTT GCT GAC ACC ACC AGC TAC CTC ATT GGC CGC 2199 
Thr Gly Phe He Tyr Phe Ala Asp Thr Thr Ser Tyr Leu He Gly Arg 
575 580 585 

CAG AAA ATT GAT GGC ACG .GAG AGA GAG ACT ATC CTG AAG GAT GGC ATC 2247 
GXn Lys He Asp Gly Thr Glu Arg Glu Thr He Leu Lys Asp Gly He 
590 .595 600 

CAC AAT GTG GAG GGC GTA GCC GTG GAC TGG ATG GGA GAC AAT CTT TAC 2295 
His Asn Val Glu Gly Val A^a Val Asp Trp Met Gly Asp Asn Leu Tyr • 
605 610 615 

TGG ACT GAT GAT GGC CCC AAG AAG ACC ATT AGT GTG GCC AGG CTG GAG 2343 
Trp Thr Asp Asp Gly Pro Lys Lys Thr He Ser Val Ala Arg Leu Glu 
620 625 630 

AAA GCC GCT CAG ACC CGG AAG ACT CTA ATT GAG GGC AAG ATG ACA CAC 2391 
Lys Ala Ala Gin Thr Arg Lys Thr Leu He Glu Gly Lys Met Thr His 
635 640 645 650 

CCC AGG GCC ATT GTA GTG GAT CCA CTC AAT GGG TGG ATG TAC TGG ACA 2.439 
Pro Arg Ala He Val Val Asp Pro Leu Asn Gly Trp Met Tyr Trp Thr 
655 660 665 

GAC TGG GAG. GAG GAC CCC AAG GAC AGT CGG CGA GGG CGG CTC GAG AGG 24*87 
Asp Trp Glu Glu Asp Pro Lys Asp Ser Arg Arg Gly Arg Leu Glu* Arg 
670 675 680 
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GCT TGG ATG GAC GGC TCA CAC CGA GAT ATC TTT GTC ACC TCC AAG ACA 2535 
Ala Trp Met Asp Gly Ser His Arg Asp He Phe Val Thr Ser hys Thr 
685 690 695 

6TG CTT TGG CCC AAT GGG CTA AGC CTG GAT ATC CCA 6CC GGA CGC CTC 2583 
Val I/eu Trp Pro Asn Gly Leu Ser Leu Asp He Pro Ala Gly Arg Leu 
700 ... 705 110. 

TAG TGG GTG GAT GCC TTC TAT GAC CGA ATT GAG ACC ATA CTG CTC AAT 2631 
Tyr Trp Val Asp Ala Phe Tyr Asp Arg He Glu Thr He Leu Leu Asn 
715 720 725 730 

GGC ACA GAC CGG AAG ATT GTA TAT GAG GGT CCT GAA CTG AAT CAT GCC 2679 
Gly Thr Asp Arg Lys He Val Tyr Glu Gly Pro Glu Leu Asn Kts Ala 
735 740 745 

TTC GGC CTG TGT CAC CAT GGC AAC TAG CTC TTT TGG ACC GAG* TAC CGG 2727 
Phe Gly Leu Cys His His Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg 
750 755 760 

AGC GGC AGC GTC TAC CGC TTG GAA CGG GGC GTG GCA GGC GCA CCG CCC 2775 
Ser Gly Ser Val Tyr Arg Leu Glu Arg Gly Val Ala Gly Ala Pro Pro 
765 770 775 

ACT GTG ACC CTT CTG CGC AGC GAG AGA CCG CCT ATC TTT GAG ATC CGA 2823 
Thr Val Thr Leu Leu Arg Ser Glu Arg Pro Pro He Phe Glu He Arg 
780 785 790 

ATG TAC GAC GCG CAC GAG CAG CAA GTG GGT ACC AAC AAA TGC CGG GTA 2871 
Met Tyr Asp Ala His Glu Gin Gin Val Gly Thr Asn Lys Cys Arg Val 
795 800 805 810 

AAT AAC GGA GGC TGC AGC AGC CTG TGC CTC GCC ACC CCC GGG AGC CGC. 2919 
Asn Asn Gly Gly Cys Ser Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg 
615 820 825 

CAG TGT GCC TGT GCC GAG GAC CAG GTG TTG GAC ACA GAT GGT GTC ACC 2967 
Gin Cys Ala Cys Ala Glu Asp Gin Val Leu Asp Thr Asp Gly Val Thr 
830 835 840 

TGC TTG GCG AAC CCA TCC TAC GTG CCC CCA CCC CAG TGC CAG CCG GGC 3015 
cys Leu Ala Asn Pro Ser Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly" 
845 850 85S 

CAG TTT GCC TGT GCC AAC AAC CGC TGC ATC CAG GAG CGC TGG AAG TGT 3063 
Gin Phe Ala Cys Ala Asn Asn Arg Cys He Gin Glu Arg Trp Lys Cys 
860 865 870 

GAC GGA GAC AAC GAC TGT CTG GAC AAC AGC GAT GAG GCC CCA GCA CTG 3111 
Asp Gly Asp Asn Asp Cys -Leu Asp Asn Ser Asp Glu Ala Pro Ala Leu 
875 880 885 B90 

TGC CAT CAA CAC ACC TGT CCC TCG GAC CGA TTC AAG TGT GAG AAC AAC 3159 
Cys His Gin His Thr Cys Pro Ser Asp Arg Phe Lys Cys Glu Asn Asn 
895 900 905 

CGG TGT ATC CCC AAC CGC TGG CTC TGT GAT GGG GAT AAT GAT TGT GGC 3207 
Arg Cys He Pro Asn Arg Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly 
910 915 920 
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hhC hoc GAG GAC GAA TCC AM GCC ACG TGC TOV GCC CGC ACC TGT CCA 3255 
Asn Ser Glu Asp GIu Sex Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro 
925 930 9^35 

GCC AAC CAG TTC TCC TGT GCC AGT GGC C6A TGC ATT CCT ATC TCA TGG 3303 
Pro Asn Gin Phe Ser Cys Ala Ser Gly.Arg Cys lie Pro lie Ser Trp 
940 945 950 

ACC TGT GAT CTG GAT GAT GAC TGT GGG GAC CGG TCC GAT GAG TCA GCC i3351 
Thr Cys Asp Leu Asp Asp Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala 
955 960 965 970 

TCA TGC GCC TAC CCC ACC TGC TTC CCC CTG ACT CAA TXT ACC TGC AAC 3399 . 
Ser Cys Ala Tyr Pro Thr Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn* 
975 980 985 

AAT GGC AG A TGT ATT AAC ATC AAC TGG CGG TGT GAC AAC GAC AAT GAC 3447 
Asn Gly Arg Cys He Asn He Asn Trp Arg Cys Asp Asn Asp Asn Asp 
990 995 1000 

TGT GGG GAC AAC AGC GAC GAA GCC GGC TGC AGT CAC TCC TGC TCC AGT 3495 
Cys Gly Asp Asn Ser Asp Glu Ala Gly Cys Ser His Ser Cys Ser Ser 
1005 1010 1015 

ACC CAG TTC AAG TGC AAC AGT GGC AGA TGC ATC CCC GAG CAC TGG ACG 3543 
Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys He Pro Glu His Trp Thr 
1020 1025 1030 

TGT GAT GGG GAC AAT GAT TGT GGG GAC TAC AGC GAC GAG ACA CAC GCC 3591 
Cys Asp Gly Asp Asn Asp Cys isly Asp Tyr Ser Asp Glu Thr . His Ala 
1035 1040. 1045 1050 

AAC TGT ACC AAC CAG GCT ACA AGA CCT CCT GGT GGC TGC CAC TCG GAT 3639 
Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly Gly Cys His Ser Asp 
1055 1060 1065 

GAG TTC CAG TGC CCG CTA GAT GGC CTG TGC ATC CCC CTG AGG TGG CGC 3687 
Glu Phe Gin Cys Pro Leu Asp Gly Leu Cys He Pro Leu Arg Trp Arg 
1070 1075 1080 

TGC GAC GGG GAC ACC GAC TGC ATG GAT TCC AGC GAT GAG AAG AGC TGT 3735 
Cys Asp Gly Asp Thr Asp Cys Met Asp Ser Ser Asp Glu Lys Ser Cys 
1085 1090 1095 

GAG GGC GTG ACC CAT GTT TGT GAC CCG AAT GTC AAG TTT GGC TGC AAG 3783 
Glu Gly Val Thr His Val Cys Asp Pro Asn Val Lys Phe Gly. Cys Lys 
1100 1105 1110 

GAC TCC GCC CGG TGC ATC AGC AAG GCG TGG GTG TGT GAT GGC GAC AGC 3831 
Asp Ser Ala Arg Cys He Ser Lys Ala Trp Val Cys Asp Gly Asp Ser 
1115 1120 1125 1130 

GAC TGT GAA GAT AAC TCC GAC GAG GAG AAC TGT GAG GCC CTG GCC TGC 3879 
Asp Cys Glu Asp Asn Ser Asp Gl^ Glu Asn Cys Glu Ala Leu Ala Cys 
1135 1140 1145 

AGG CCA CCC TCC CAT CCC TGC GCC AAC AAC ACC TCT GTC TGC CTG CCT 3927 
Arg Pro Pro Ser His Pro Cys Ala Asn Asn Thr Ser Val Cys Leu Pro 
1150 1155 1160 
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CCT GAC MG CTG T6C GAC GGC AAG GAT GAC TGT GGA GAC GGC TCG GAT 3975 
Pro Asp Lys Leu Cys Asp Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp • 
1165 1170 ii75 

GAG GGC GAG CTC TGT GAC CAG TGT TCT CTG AAT AAT GGT GGC TGT AGT 4023 
Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser 
1180. 1185 1190 . ' 

CAC AAC TGC TCA GTG GCC CCT GGT GAA GGC ATC GTG TGC TCT TGC CCT 4071 
His Asn Cys Ser Val Ala Pro Gly Glu Gly lie Val Cys Ser Cys Pro 
1195 1200. 1205 1210 

CTG GGC ATG GAG CTG GGC TCT GAC AAC CAC ACC TGC CAG ATC CAG AGC 41tt 
Leu Gly Met Glu Leu Gly Ser Asp Asn His Thr Cys Gin lie Gin Ser 
1215 1220 1225 

TAG TGT GCC AAG CAC CTC AAA TGC AGC CAG AAG TGT GAC CAG AAC AAG 4167 
Tyr Cys Ala Lys His Leu Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys 
1230 1235 1240 

TTC AGT GTG AAG TGC TCC TGC TAC GAG GGC TGG GTC TTG GAG CCT GAC 4215 
Phe Ser Val Lys Cys Ser Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp 
1245 1250 1255 

GGG GAA ACG TGC CGC AGT CTG GAT CCC TTC AAA CTG TTC ATC ATC TTC 4 263 
Gly Glu Thr Cys Arg Ser Leu Asp Pro Phe Lys Leu Phe lie lie Phe 
1260 1265 1270 

TCC AAC CGC CAC GAG ATC AGG CGC ATT GAC CTT CAC AAG GGG GAC TAC 4311 
Ser Asn Arg His Glu lie Arg Arg lie Asp Leu His Lys Gly Asp Tyr 
1275 1280 1285 1290 

AGC GTC CTA GTG CCT GGC CTG CGC AAC ACT ATT GCC CTG GAC TTC CAC 4359 
Ser Val Leu Val Pro Gly Leu Arg Asn Thr lie Ala Leu Asp Phe His 
1295 1300 1305 

CTC AGC CAG AGT GCC CTC TAC TGG ACC GAC GCG GTA GAG GAC AAG ATC 4407 
Leu Ser Gin Ser Ala Leu Tyr Trp Thr Asp Ala Val Glu Asp Lys lie 
1310 1315 1320 

TAC CGT GGG AAA CTC CTG GAC AAC GGA GCC CTG ACC AGC TTT GAG GTG 4455 
Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala Leu Thr Ser Phe Glu Val" 
1325 1330 1335 

GTG ATT CAG TAT GGC TTG GCC ACA CCA GAG GGC CTG GCT GTA GAT TGG 4503 
Val ile Gin Tyr Gly Leu Ala Thr Pro Glu Gly Leu Ala Val Asp Trp 
1340 .. 1345 1350 • 

ATT GCA GGC AAC ATC TAC TGG GTG GAG AGC AAC CTG GAC CAG ATC GAA 4551 
lie Ala Gly Asn lie Tyr Trp Val Glu Ser Asn tea Asp Gin lie Glu 
1355 1360 1365 1370 

GTG GCC AAG CTG GAC GGA ACC CTC CGA ACC ACT CTG CTG GOG GGT GAC 4599 
Val Ala Lys Leu Asp Gly Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp 
1375 1380 1385 

ATT GAG CAC CCG AGG GCC ATC GCT CTG GAC CCT CGG GAT GGG ATT CTG 4647 
lie Glu His Pro Arg Ala lie Ala Leu Asp Pro Arg Asp Gly lie Leu 
1390 1395 1400 
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TTT TGG ACA GAC TGG GAT GCC AGC CT6 CCA CGA ATC GAG 6CX "GCA TCC 4695 
Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro Arg lie Glu Ala Ala Ser 
1405 1410 1415 

ATG AGT GGA GCT GGC CGC CGA ACC ATC CAC CGG GAG ACA 66C TCT GGG 4743 
Met Ser Gly Ala Gly Arg Arg Thr He His Arg Glu Thr Gly Ser Gly 
1420 1425 1430 • ' ' 



GGC TGC GCC AAT GGG CTC ACC GTG GAT TAC CTG GAG AAG CGC ATC CTC 
Gly Cys Ala Asn Gly Leu Thr Val Asp Tyr Leu Glu Lys Arg He Leu 
1435 1440 1445 1450 



47S1 



TGG ATT GAT GCT AGG TCA GAT GCC ATC TAT TCA GCC CGG TAT GAC GGC 
Trp He Asp Ala Arg Ser Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly 
1455 1460 • 1465 



4839 



TCC GGC CAC ATG GAG GTG CTT CGG GGA CAC GAG TTC CTG TCA CAC CCA 
Ser Gly His Met Glu Val Leu Arg Gly His Glu Phe Leu Ser His Pro 
1470 1475 1480 



4887 



TTT GCC GTG ACA CTG TAC GGT GGG GAG GTG TAC TGG ACC GAC TGG CGA 
Phe Ala Val Thr Leu Tyr Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg 
1485 1490 1495 



4935 



ACA AAT ACA CTG GCT AAG GCC AAC AAG TGG ACT GGC CAC AAC GTC ACC 
Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp Thr Gly His Asn Val Thr 
1500 1505 1510 



4983 



GTG GTA CAG AGG ACC AAC ACC CAG CCC TTC GAC CTG CA6 GTG TAT CAC 
Val Val Gin Arg Thr Asn Thr Gin Pro Phe Asp Leu Gin Val Tyr His 
1515 1520 1525 1530 



5031 



CCT TCC CGG CAG CCC ATG GCT CCA AAC CCA TGT GAG GCC AAT GGC GGC 
Pro Ser Arg Gin Pro Met Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly 
1535 1540 1545 



5079 



CGG GGC CCC TGT TCC CAT CTG TGC CTC ATC AAC TAC AAC CGG ACC GTC 
Arg Gly Pro Cys Ser His Leu Cys Leu He Asn Tyr Asn Arg Thr Val 
1550 1555 1560 



5127 



TCC TGG GCC TGT CCC CAC CTC ATG AAG CTG CAC AAG GAC AAC ACC ACC 5175 
Ser Trp Ala Cys Pro His Leu Met Lys Leu His Lys Asp Asn Thr Thr 
1565 1570 1575 

TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC GCA CGT CAG ATG GAG ATC 5223 
Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr Ala Arg Gin Met Glu He 
1580 1595 1590 

CGG GGC GTG GAC CTG GAT GCC CCG TAC TAC AAT TAT ATC ATC TCC TTC 5271 
Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr Asn Tyr He He Ser Phe 
1595 .1600 1605 1610 

ACG GTG CCT GAT ATC GAC AAT GTC ACG GTG CTG GAC TAT GAT GCC CGA . 5319 
Thr Val Pro Asp He Asp Asn Val Thr Val Leu Asp Tyr- Asp Ala Arg 
1615 1620 1625 

GAG CAG CGA GTT TAC TGG TCT GAT GTG CGG ACT CAA GCC ATC AAA AGG 5367 
Glu Gin Arg Val Tyr Trp Ser Asp Val Arg Thr Gin Ala He Lys Arg 
1630 1635 1640 
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GCA TXT ATC AAC 6GC ACT GGC GTG GAG ACC GTT GTC TCT GCA GAC TTG 
Ala Phe He Asn Gly Thr Gly Val Glu Thr Val Val Ser Ala Asp Leu 
1645 1650 1655 



5415 



CCC AAC GCC CAC GGG CTG 6CT GTG GAC TGG GTC TCC C6A AAT CTG TTT 
Pro Asn Ala His Gly* Leu Ala Val Asp Trp Val Ser Arg Asn Leu Phe 
1660 1665 1670 



5463 



TGG ACA AGT TAG GAC ACC AAC AAG AAG CAG ATT AAC GTG GCC CGG CTG 5511 
Trp Thr Ser Tyr Asp Thr Asn Lys Lys Gin He Asn Val Ala Arg Leu 
1675 1680 • 1685 1690 

GAC GGC TCC TTC AAG AAT GCG GTG GTG CAG GGC CTG GAG CAG CCC CAC 5559* 
Asp Gly Ser Phe Lys Asn Ala Val Val Gin Gly Leu Glu Gin Pro His 
1695 1700 1705 



GGC CTG GTC GTC CAC CCG CTT CGT GGC AAG CTC TAC TGG ACT GAT GGG 
Gly Leu Val Val His Pro Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly 
1710 1715 1720 



5607 



GAC AAC ATC AGC ATG GCC AAC ATG GAT GGG AGC AAC CAC ACT CTG CTC 
Asp Asn He Ser Met Ala Asn Met Asp Gly Ser Asn His Thr Leu Leu 
1725 1730 1735 



5655 



TTC AGT GGC CAG AAG GGC CCT GTG GGG TTG GCC ATT GAC TTC CCT GAG 
Phe Ser Gly Gin Lys Gly Pro Val Gly Leu Ala He Asp Phe Pro Glu 
1740 1745 1750 



5703 



AGC AAA CTC TAC TG6 ATC AGC TCT GGG AAC CAC ACA ATC AAC CGT TGC 
Ser Lys Leu Tyr Trp He Ser Ser Gly Asn His Thr lie Asn Arg Cys 
1755 1760 1765 1770 



5751 



AAT CTG GAT GGG AGC GAG CTG GAG GTC ATC GAC ACC ATG CGG AGC CAG 
Asn Leu Asp Gly Ser Glu Leu Glu Val He Asp Thr Met Arg Ser Gin 
1775 i780 1785 



5799 



CTG GGC AAG GCC ACT GCC CTG GCC ATC ATG GGG GAC AAG CTG TGG TGG 
Leu Gly Lys Ala Thr Ala Leu Ala He Met Gly Asp Lys Leu Trp Trp 
1790 1795 1800 



5847 



GCA GAT CAG GTG TCA GAG AAG ATG GGC ACG TGC AAC AAA GCC GAT GGC 
Ala Asp Gin Val Ser Glu Lys Met Gly Thr Cys Asn Lys Ala Asp Gly 
1805 1810 1815 



5B95 



TCT GGG TCC GTG GTG CTG CGG AAC AGT ACC ACG TTG GTT ATG CAC ATG 
Ser Gly Ser Val Val Leu Arg Asn Ser Thr Thr Leu Val Met His Met 
1820 1825 . . 1830 



5943 



AAG GTG TAT GAC GAG AGC ATC CAG CTA GAG CAT GAG GGC ACC AAC CCC 5991 
Lys Val Tyr Asp Glu Ser He Gin Leu Glu His Glu Gly Thr Asn Pro 
1835 1840 1845 1850 

TGC AGT GTC AAC AAC GGA GAC TGT TCC CAG CTC TGC CTG OCA ACA TCA 6039 
Cys Ser Val Asn Asn Gly Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser 
1855 1860 1865 



GAG ACG ACT CGC TCC TGT ATG TGT ACA GCC GGT TAC AGC CTC CGG AGC 
Glu Thr Thr Arg Ser Cys Met Cys Thr Ala Gly Tyr Ser Leu Arg. Ser 
1870 1875 1880 



6087 
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GGA CAG CAG GCC TGT GAG GGT GTG GGC TCT TTT CTC CTG TAG TCT GTA 6135 
Gly Gin Gin Ala Cys Glu Gly Val Gly Ser Phe Leu Leu Tyr Ser Val 
1885 1890 1895 

CAT GAG GGA ATT CGG GGG ATT CCA CTA GAT CCC AAT GAC AAG TCG GAT 6183 
His Glu -Gly He Arg Gly He Pro Leu Asp Pro Asn Asp hya Ser Asp 
1900 1905 1910 

GCC CTG GTC CCA GTG TCC GGA ACT TCA CTG GCT GTC GGA ATC GAC TTC 6231 
Ala Leu Val Pro Val Ser Gly Thr Ser Leu Ala Val Gly He Asp Phe 
1915 1920 1925 1930 

CAT GCC GAA AAT GAC ACT ATT TAT TGG GTG GAT ATG GGC CTA AGC ACC 6279 
His Ala Glu Asn Asp Thr He Tyr Trp Val Asp Met Gly Leu Ser Thr 
1935 1940 1945 

ATC AGC AGG GCC AAG CGT GAC CAG ACA TGG CGA GAG GAT GTG GTG ACC 6327 
He Ser Arg Ala Lys Arg Asp Gin Thr Trp Arg Glu Asp Val Val Thr 
1950 1955 1960 

AAC GGT ATT GGC CGT GTG GAG GGC ATC GCC GTG GAC TGG ATC CCA GGC 6375 
Asn Gly He Gly Arg Val Glu Gly He Ala Val Asp Trp He Ala Gly 
1965 1970 1975 

AAC ATA TAC TGG ACG GAC CAG GGC TTC GAT GTC ATC " GAG GTT GCC CGG 6423 
Asn He Tyr Trp Thr Asp Gin Gly Phe Asp Val He Glu Val Ala Arg 
1980 1985 1990 

CTC AAT GGC TCT TTT CGT TAT GTG GTC ATT TCC CAG GGT CTG GAC AAG 6471 
Leu Asn Gly Ser Phe Arg Tyr Val Val He Ser Gin Gly Leu Asp Lys 
1995 2000 2005 2010 

CCT CGG GCC ATC ACT GTC CAC CCA GAG AAG GGG TAC TTG TTC TGG ACC 6519 
Pro Arg Ala He Thr Val His Pro Glu Lys Gly Tyr Leu Phe Trp Thr 
2015 2020 2025 

GAG TGG GGT CAT TAC CCA CGT ATT GAG CGG TCT CGC CTT GAT GGC ACA 6567 
Glu Trp Gly His Tyr Pro Arg He Glu Arg Ser Arg Leu Asp Gly Thr 

2030 2035 2040 , 

GAG AGA GTG GTG TTG GTT AAT GTC AGC ATC AGC TGG CCC AAT GGC ATC 6615 
Glu Arg Val Val Leu Val Asn Val Ser He Ser Trp Pro Asn Gly He 
2045 2050 2055 

TCA GTA GAC TAT CAG GGC GGC AAG CTC TAC TGG TGT GAT GCT CG(3 ATG 6663 
Ser Val Asp Tyr Gin Gly Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met 
2060 2065 2070 

GAC AAG ATC GAG CGC ATC GAC CTG GAA ACG GGC GAG AAC CGG GAG GTG 6711 
Asp Lys lie Glu Arg He Asp Leu Glu Thr Gly Glu Asn Arg Glu Val 
207S 2080 2085 2090 

GTC CTG TCC AGC AAT AAC ATG GAT ATG TTC TCC GTG TCC GTG TTT GAG 6759 
Val Leu Ser Ser Asn Asn Met Asp Met Phe Ser Val Ser yal Phe Glu 
2095 2100 2105 

GAC TTC ATC TAc' TGG AGT GAC AGA ACT CAC GCC AAT GGC TCC ATC AAG 6807 
Asp Phe He Tyr Trp. Ser Asp Arg Thr His Ala Asn Gly Ser He Lys 
^ 2110 2115 2120 
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S« l?!^ ^ GTG CCT CTG AGG ACA GGC 

Arg Gly Cys Lys Asp Asn Ala Thr Asp Ser Val Pro Leu Arg Thr Gly 
2125 2130 2135 

ATT GGT GTT CAG CTT AAA GAC ATC AAG GTC TTC AAC AGG GAC AGG CAG 
o?iX Jisp He Lys Val Phe Asn Arg Asp Arg Gin 

2140 2145 • 2150 

AAG GGT ACC AAT GTG TGC GCG* 6TA GCC MC GGC GGG TGC CAG CAG CTC 
Lys Gly Thr Asn Val Cys Ala Val Ala Asn Gly Gly Cys Gin Gin Leu 
2155 2160. 2165 . 2170 

TGC TTG TAT CGG GGT GGC GGA CAG CGA GCC T6T GCC TGT GCC CAC GGG 
Cys Leu Tyr Arg Gly Gly Gly Gin Arg Ala Cys Ala Cys Ala His Gly 
2175 2180 2185 

ATG CTG GCA GAA GAC GGG GCC TCA TGC CGA GAG TAC GCT GGC TAC CTG 
Met Leu Ala Glu Asp Gly Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu 
2190 2195 2200 

CTC TAC TCA GAG CGG ACC ATC CTC AAG AGC ATC CAC CTG TCG GAT GAG 
Leu Tyr Ser Glu Arg Thr He Leu Lys Ser He His Leu Ser Asp Glu 
2205 2210 2215 

CGT AAC CTC AAC GCA CCG GTG CAG CCC TTT GAA GAC CCC GAG CAC ATG 
Arg Asn Leu Asn Ala Pro Val Gin Pro Phe Glu Asp Pro Glu His Met 
2220 2225 2230 

AAA AAT GTC ATC GCC CTG GCC TTT GAC TAC CGA GCA GGC ACC TCC CCG 
Lys Asn Val He Ala Leu Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro 
^^"^ 2240 2245 • 2250 



2235 



GGG ACC CCT AAC CGC ATC TTC TTC AGT GAC ATC CAC TTT GGG AAC ATC 
Gly Thr Pro Asn Arg He Phe Phe Ser Asp He His Phe Gly Asn He 
2255 2260 2265 

CAG CAG ATC AAT GAC GAT GGC TCG GGC AGG ACC ACC ATC GTG GAA AAT 
Gin Gin He Asn Asp Asp Gly Ser Gly Arg Thr Thr He VaJ. Glu Asn 
2270 2275 2280 

GTG GGC TCT GTG GAA GGC CTG GCC TAT CAC CGT GGC TGG GAC ACA CTG 
Val Gly Ser Val Glu Gly Leu Ala Tyr His Arg Gly Trp Asp Thr Leu 
2285 2290 2295 

TAC TGG ACA AGC TAC ACC ACA TCC ACC ATC ACC CGC CAC ACC GTG GAC 
Tyr Trp Thr Ser Tyr Thr Thr Ser Thr He Thr Arg His Thr Val Asp 
2300 • 2305 2310 

CAG ACT CGC CCA GGG GCC TTC GAG AGG GAG ACA GTC ATC ACC ATG TCC 
Gin Thr Arg Pro Gly Ala Phe Glu Arg Glu Thr Val He Thr Met Ser 
2315 2320 2325 2330 

GGA GAC GAC CAC CCG AGA GCC TTT GTG CTG GAT GAG TGC CAG AAC CTG 
Gly Asp Asp His Pro Arg Ala Phe Val Leu Asp Glu Cys Qln Asn Leu 
2335 2340 2345 

ATG TTC TGG ACC AAT TGG AAC GAG CTC CAT CCA AGC ATC ATG CGG GCA 
Met Phe Trp Thr Asn Trp Asn Glu Leu His Pro Ser He Met Arg Ala 
2350 2355 2360 
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IS ss^i ^ z ss j; s jn ^ 

2375 

?S 5S J^J ^ S SS J2 S= S« ?f f»« 

2380 ■ ^ ^9 Glu Lys Leu Tyr Phe 

2385 2390 

2405 2410 

lis SrS S5 ne 1"=^ ^ 

Ar« Tyr Val lie Leu Lys Ser Glu Pro Val His Pro Phe Gly Leu 

2415 2420 2425 

IS JS ?S ^ JS S= ?S 5S IS I^ S 

2435 2440. 

5S Si Sf '^^^ ^'^C CTG CTT CGG 

Val Gin Ala Asn Lys Tyr Val Gly Ser Asp Met Lys. Leu Leu Arg 

2450 2455 

ill So ill Pro 111 efi p^^^^ ^""^ ^"^^ GT6 GCC AAT GAC 

2460 "^'^ Val Ala Asn Asp 

2465 2470 

JSr f ^ ^^"^ ''^^ ^<5C CGT ATC AAC AAT 6GA GGC T6C 

Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg lie Asn Asn ciy cly Cys 
2480 2485 2490 

CAG GAT CTG T6T CTG CTC ACC CAC CAA GGC CAC GTC AAC TGT TCC TGT 
Gin Asp Leu Cys Leu Leu Thr His Gin Gly vll JJn Jys Ser Jys 

2500 2505 

SS SS ??f f ^ rf ^'^'^ '^^C C<56 6CT GTG AAC 

Arg Gly eiy Arg He Leu Gin Glu Asp Phe Thr Cys Arg Ala Val Asn 

2510 2515 2520 

TCC TCT TGT CGG 6CA CAA GAT GAG TTT GAG TGT GCC AAT GGG GAA TGT 
Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu Cys aS iTy gJS gs ' 
2525 2530 2535 

ill JIf ^ t'''' ^'^C AAG GAC AAG 

alio 2545 ^ ^^%550 ^y^* ^y"* 

2r So «u fOJ f ^ 2'''' ^''^ CGC CGC TGC AAG AAG ACT 

Ser Asp Glu Lys Pro Ser Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr 

ZSoO 2565 2570 

Jko AAC AAT GGC CGC icT .GTA TCC AAC ATG CTG TGG TGC 

Phe Arg Gin Cys Asn Asn Gly Arg Cys Val Ser Asn Het Leu Trp Cys 
2S75 2580 2585 

AAT GGG GTG GAT TAC TGT GGG GAT GGC TCT GAT GAG ATA CCT TGC AAC 
Asn Gly Val Asp Tyr Cys Gly Asp Gly Ser Asp Glu He Pro Cyi Asn 
2590 2595 2600 
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AAG ACT GCC TGT GGT GTG GGT GAG TTC C6C TGC CGG GAT GGG TCC TGC 8295 
Lys Thr Ala Cys Gly Val Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys 
2605 2610 2615 

ATC GGG AAC TCC AGT. C6C TGC AAC CAG TTT GTG GAT TGT GAG GAT GCC 8343 
lie Gly Asn Ser Ser Arg. Cys Asn Gin Phe Val Asp Cys Glu Asp Ala 
2620 2625 2630 

TCG GAT GAG ATG AAT TGC AGT GCC ACA GAC TGC AGC AGC TAT TTC CGC 8391 " 
Ser Asp Glu Met Asn Cys Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg 
2635 2640. 2645 2650 

CTG GGC GTG AAA GGT GTC CTC TTC CAG CCG TGC GAG CGG ACA TCC CTG 84.39 
Leu Gly Val Lys Gly Val Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu 
2655 2660 2665 

TGC TAC 6CA CCT AGC TGG GTG TGT GAT GGC GCC AAC GAC TGT GGA GAC 8487 
Cys Tyr Ala Pro Ser Trp Val Cys Asp Gly Ala Asn Asp Cys Gly Asp 
267 0 2675 2680 

TAC AGC GAT GAA CGT GAC TGT CCA GGT GTG AAG CGC CCT AGG TGC CCG 8535 
Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val Lys Arg Pro Arg Cys Pro 
2665. 2690 2695 

CTC AAT TAC TTT GCC TGC CCC AGC GGG CGC TGT ATC CCC ATG AGC TGG 8583 
Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg Cys He Pro Met Ser Trp 
2700 2705 2710 

ACG TGT GAC AAG GAG GAT GAC TGT GAG AAC GGC GAG GAT GAG ACC CAC 8631 
Thr Cys Asp Lys Glu Asp Asp Cys Glu Asn Gly Glu Asp Glu Thr His 
2715 2720 2725 2730 



TGC AAC AAG TTC TGC TCA GAG GCA CAG TTC GAG TGC CAG AAC CAC CGG 
Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe Glu Cys Gin Asn His Arg 
2735 2740 2745 



8679 



TGT ATC TCC AAG CAG TGG CTG TGT GAC GGT AGC GAT GAT TGC GGG GAT 
Cys He Ser Lys Gin Trp Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp 
2750 2755 2760 



8727 



GGC TCC GAT GAG GCA GCT CAC TGT GAA GGC AAG ACA TGT GGC CCC TCC 
Gly Ser Asp Glu Ala Ala His Cys Glu Gly Lys Thr Cys Gly Pro Ser- 
2765 2770 2775 



8775 



TCC TTC TCC TGT CCC GGC ACC CAC GTG TGT GTC CCT GAG CGC TGG CTC 
Ser Phe Ser Cys Pro Gly Thr His Val Cys Val Pro Glu Arg Trp Leu 
2780 2785 2790 



8823 



TGT GAT GGC GAC AAG GAC TGT ACC GAT GGC GCG GAT GAG AGT GTC ACT 
Cys Asp Gly Asp Lys Asp Cys Thr Asp Gly Ala Asp Glu Ser Val Thr 
2795 2800 2805 2810 



8871 



GCT GGC TGC CTG TAC AAC AGC ACC TGT GAT GAC CGT GAG TTC ATG TGC 
Ala Gly Cys Leu Tyr Asn Ser Thr Cys Asp Asp Arg Glu ^he Met Cys 
2815 2820 2825 



8919 



CAG AAC CGC TTG TGT ATT CCC AAG CAT TTC GTG TGC GAC CAT GAC CGT 
Gin. Asn Arg Leu Cys He Pro Lys His Phe Val Cys Asp His Asp Arg 
2630 2835 2840 
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6AC TGT GCT GAT GGC TCT GAT GAA TCC CCT GAG TGT GAG TAG CCA ACC 901S 
Asp Cys Ala Asp Gly Ser Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr 
2845 2850 2855 

TGC 6GG CCC AAT GAA TTC CGC TGT GCC AAT GGG C6T TGT CTG A6C TCC 9063 
Cys Gly . Pro Asn Glu Phe Arg Cys Ala Asn Gly Arc Cys Leu Ser Ser 
2860 2865 2870 

CGT CAG TGG GAA TGT GAT GGG GAG AAT GAC TGT CAC GAC CAC AGO GAT 9111 
Arg Gin Trp Glu Cys Asp Gly Glu Asn Asp Cys His Asp His Ser Asp 
2875 2880. 2885 2890 

GAG GCT CCC AAG AAC CCA CAC TGC ACC AGC CCA GAG CAC AAA TGC AAT 9159 
Glu Ala Pro Lys Asn Pro His Cys Thr Ser Pro Glu His Lys Cys Asn 
2895 2900 2905 

GCC TCA TCA CAG TTC CTG TGC AGC AGC GGG CGC TGC GTG GCT GAG GCG 9207 
Ala Ser Ser Gin Phe Leu Cys Ser Ser Gly Arg Cys Val Ala Glu Ala 
2910 2915 2920 

TTG CTC TGC AAC GGC CAG GAC GAC TGT GGG GAC GGT TCA GAC GAA CGC 9255 
Leu Leu Cys Asn Gly Gin Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg 
2925 2930 2935 

GGG TGC CAT GTC AAC GAG TGT CTC AGC CGC PAQ CTC AGT GGC TGC AGT 9303 
Gly Cys His Val Asn Glu Cys Leu Ser Arg Lys Leu Ser Glv Cys Ser 
2940 2945 2950 

CAG GAC TGC GAG GAC CTC AAG ATA GGC TTT MG TGC CGC TGT CGC CCG 9351 
Gin Asp Cys Glu Asp Leu Lys He Gly Phe Lys Cys Arg Cys Arg Pro 
2955 2960 2965 2970 

GGC TTC CGG CTA AAG GAC GAT GGC AGG ACC TGT GCC GAC CTG GAT GAG 9399 
Gly Phe Arg Leu Lys Asp Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu 
2975 2980 2985 

TGC AGC ACC ACC TTC CCC TGC AGC CAG CTC TGC ATC AAC ACC CAC GGA 9447 . 
Cys Ser Thr Thr Phe Pro 'Cys Ser Gin Leu Cys He Asn Thr His Gly 
2990 2995 3000 

AGT TAC AAG TGT CTG TGT GTG GAG GGC TAT GCA CCC CGT GGC GGT GAC 9495 
Ser Tyr Lys Cys Leu Cys Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp 
3005 3010 3015 

CCC CAC AGC TGC AAA GCT GTG ACC GAT GAG GAG CCA TTT CTC ATC TTT • 9543 
Pro His Ser Cys Lys Ala Val Thr Asp Glu Glu Pro Phe Leu He Phe 
3020 3025 3030 

GCC AAC CGG TAC TAC CTG CGG AAG CTC AAC CTG GAC GGC TCC AAC TAC 9591 
Ala Asn Arg Tyr Tyr Leu Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr 
3035 3040 3045 3050 

ACA CTG CTT AAG CAG GGC CTG AAC AAT GCG GTC GCC TTG GCA TTT GAC 9639 
Thr Leu Leu Lys Gin Gly Leu Asn Asn Ala Val Ala Leu Ala Phe Asp 
3055 3060 ' 3065 

TAC CGA GAG CAG ATG ATC TAC TGG ACC GGC GTG ACC ACC CAG GGC AGC 9687 
Tyr Arg Glu Gin Met He Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser 
3070 3075 3080 
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ATG ATT CGC AGG ATG CAC CTC AAr am b^.- 
Met He Arg Arg Met His Leu P^n ^ ^ ^ 

3085 3aIa Gin Val Leu His 

3095 

C6G ACG GGC CTT AGT AAC C<?a mm i-^^ ^ 

Arg Thr Gly Leu Ser AsS SI If? ^'^^ G^T 

3100 3iSs ^ '^^^ '^P Val Gly 

•'^OS 3110 

GGC AAC CTG TAC TGG TGT GAC AAr" rr^ 

Gly Asn Leu Tyr Trp Cys Aso lJs g?5 ^^'^ ^CC 

3115 3i5o ^ lie Gla Val Ser 

3125 3130 

2J JSS G?5 SS ?° e^? 

3135 ^ ^^i*!!! Ser Gly Leu Arg 

3140 3145 

3150 I-eu Tyr Trp 

3155 3150 

?S j| Z M S S5 S= K S S 



3175 



9735 



9783 



9831 



9879 



9975 



10023 



IT. J ^. ?X K ?S 152 ?S Z 
,i S ^ - J - - - gc^c. 

3220 3225 
3230 3235 3240 



Z J Zl^^ LI j| SSI Jg - S 

^ 3265 3270 . 

^ «I? Sp llu Hil f ^ ^^'^^ C<=A e«3 10311 

3275 ^ %55n ^ ^''^ "^'^ Gin Pro Asp Val 

3285 3290 

^295 3300 - 3305 ^ 

CTG CTG TCC <X:T GGG GGT GGT CAC AAG TGC GCC TGC CCC ACC AAC TTC 10407 
Leu Leu Ser Pro Gly Gly Gly His Lys Cys Ala cjs So ?S Z 

3315 3320 
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SS^I IS Z ^, 5S S J^S ?S Si 

3330 3335 

Sn SI SI? r*^"" ''^^ ATC CCC TTC TGG TGG AAG TGT 

Mis '''' ""^^ "^"^^f P~ Trp irp IVs 

'^'^^w 3345 335Q 

Sd ?hr G?S Sn S^'S r^^ '"'^ GAG CCT CCA GAC TGT 

Asp Thr Glu Asp Asp Cys Gly Asp Mis Ser Asp Glu Pro Pro Asp Cys 
3360 3365 *'33^j, 

CCC GAG TTC AAG T6C OGC CCA G6C GAG TTC CAG TGC TCC ACC GGC ATC 
Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe Sn l^s sS JS G?J 

3380 33g| 

S5 ?S III ^^'^ ^'^^ CAA GAC 

cys Thr Asn Pro Ala Phe lie Cys Asp Gly Asp Asn Asp Cys Gin Asp 

3395 3400 

sll 2o 111 5?J 1°^ ^ *fG«= 'TG CCC AGC CAA 

'^'^ Val Cys Leu Pro Ser Gin 

3410 34X5 

III iSs i^n ^ ^ '^^''^ ■'^^ <^^C ATC TTC CGT TGC 

^ ^'*',^f? ^''^ Cys He Pro Gly lie Phe Arg Cys 

3425 • 3430 

t^"^ f J^"" ^''C GGC GAG GAT GAG CGG GAT TGC CCT 
Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro 
3440 3445 3,50 

Gil SI? r^"" l"""" ^C CAG TTC CAG TGC TCC ATC ACC AAG C6C 
Glu Val Thr Cys Ala Pro Asn Gin Phe Gin Cys Ser He Thr tys Aro 
3«SS 34 go 3465 

TGC ATC CCT CGC GTC TGG GTC TGT GAC AGG GAT AAT CAC TGT GTG GAC 
Cys He Pro Arg Val Trp Val Cys Asp Arg Asp Asn His Cys Val Asp 
3<70 3475 34I0 

GGC AGT GAT GAG CCT GCC AAC TGT ACC CAA ATG ACC TGT 6GA GTG GAT 
Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin Met Thr Cys Gly Val Asp 
3485 3490 3495 

GAG TTC CGC TGC AAG GAT TCT GGC CGC TGC ATC CCC GCG CGC TGG AAG 

ocn« ^^'^ ^9 ^« lie Pro Ala Arg Trp Lys 

Jooo 3505 3510 

TGT GAC GGA GAA GAT GAC TGT GGG GAT GGT TCA GAT GAG CCC AAG GAA 
Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu 
3520 3525 3530 

GAG TGT GAT GAG CGC ACC TGT GAG CCA TAC CAG TTC CGC TGC AAA AAC 
Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr Gin Phe Arg gys Lys Asn 
3535 3540 3545 

AAC CGC TGT GTC CCA GGC CGT TGG CAA TGT GAC TAC GAC AAC GAC TGC 
Asn Arg Cys Val Pro Gly Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys 
3550- 3555 3sgQ 
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Z£ 12 S SI 12 ss 

" 3570 3575 

Ser lie f ^ ?f CGC TGC ATC GCT GGG 06C TGG AAG 

3580 ^ 3585 ^ ^ "^SM? '^^ 

TGT GAT 6GG 6AC CAT 6AC TGT GCC GAC GGC TCA GAC GAG AAA GAC TGC 
Cys Asp Gly Asp His Asp Cys Ala Asp Gly Ser A^p ^ 

3600. 3605 3610 

AOC CCC CGC TGT GAT ATG GAC CAG TTC CA6 TGC AAG AGT GGC CAC TGC 
Thr Pro Arg Cys Asp Met Asp Gin Phe Gin Cys Lys Ser ?ys 

3620 '3625 

50 SS So III f ° ^^'^ ^'^^ '^^^ ^'^C TGT ATG GAC GGC 
lie Pro leu Arg Trp Pro Cys Asp Ala Asp Ala Asp Cys Met Asp Gly 

3635 3640 

AGT GAC GAG GAA GCC TGT GGC ACT GGG GTG AGG ACC TGC CCA TTG GAT 
Ser Asp Glu Glu Ala Cys Gly Thr Gly Val Arg Thr C^s Pro Isp 
3645 3650 3655 

cfS SI S?? f ^^"^ '^^ GCC TGG AAG TGT 

3660 3665 ^'°3670 "^'^ 

SI G?J 2?J aJo f^'^ J'^^ ''''^ «AG AAC CCC GAG GAA 

Asp Gly Glu Asp Asp Cys Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu 
3680 3685 3690 

lis 111 S« t];^ S*^' ^ «^ CGC TGC AAG AAT 

Cys Ala Arg Phe lie Cys Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn 
36S5 3700 3705 

51 vl? ''V ^'^G GAT GGC GTG GAC AAC 
Asp Arg Val Cys Leu Trp lie Gly Arg Gin Cys Asp Gly Val Asp Asn 

3^10 3715 •^3720 

IV. l^l ^GT GAG CCC CCC ACG GCC CAG 

Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin 
3725 3730 3735 

AAC CCC CAC TGC AAA GAC AAG AAG GAG TTC CT6 TGC CGA AAC CAG CGC 

\^An " Asp lys Lys Glu Phe Leu Cys Arg Asn Gin Arg 

3'40 3745 37SQ 

SI P° TTC GAT GAC TGC GGC GAT 

375? 37lS ^ "**'37M *^^^3770 

Jf^ r?^ T*^"" A'*= G'^C CCC AAG CTG' ACC AOC TGT 

Gly Ser Asp Glu Glu Asp Cys Ser lie Asp Pro Lys Leu Thr Ser Cys 
377S 3780 3785 

GCC ACC AAT.GCC A6C ATG TGT GGG GAC GAA GCT CGT TGT GTG CGC ACT 
Ala Thr Asn Ala Ser Het Cys Gly Asp Glu Ala Arg Cys Val Arg Thr 
3'50 3795 " ' 3800 
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GAG AAA GCT GCC TAG TGT GCC TGC CGC TCG GGC TTC CAT ACT 6TG CCG 

^%?it ^ ^'^ ""^^ ^9 His Thr vS Pro 

3^0^ 3810' 3815 

GGC CAG CCC GGA TGC CAG GAC ATC AAC GAG TGC CTG CGC TTT GGT ACC 
^ '^^%^f ^rg Phe Gly Thr 

.TGC TCT CAG CTC TGG AAC AAA CCC AAG GGA GGC CAC CTC TGC AGC TGT 
Cys Ser Gin Leu Trp Asn Lys Pro Lys Gly Gly His Leu Cys Ser Cys 
3835 3840 3345 3950 

GCC CGC AAC TTC ATG AAG ACA CAC AAC ACC TGC AAA GCT GAA GGC TCC 
Ala Arg Asn Phe Met Lys Thr His Asn Thr Cys Lys Ala GLu Gly Ser 
3855 3860 3865 

GAG TAC CAG GTG CTA TAC ATC GCG GAT GAC AAC GAG ATC CGC AGC TTG 
Glu Tyr Gin Val Leu Tyr lie Ala Asp Asp Asn Glu lie Arg Ser Leu 
3870 3875 3880 

TTC CCG GGC CAC CCC CAC TCA GCC TAC GAG CAG ACA TTC CAG GGC GAT 
Phe Pro Gly His Pro His Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp 
3885 3890 3395 

GAG AGT GTC CGC ATA GAT GCC ATG GAT GTC CAT GTC AAG GCC GGC CGT 
Glu Ser Val Arg lie Asp Ala Met Asp Val His Val Lys Ala Gly Arg 
3900 3905 3910 

GTC TAC TGG ACT AAC TGG CAC ACG GGC ACA ATC TCC TAC AGG AGC CTG 
Val Tyr Trp Thr Asn Trp His Thr Gly Thr He Ser Tyr Arg Ser Leu 
3915 3920 3925 3930 

CCC CCT GCC GCC CCT CCT ACC ACT TCC AAC CGC CAC CGG AGG CAG ATC 
Pro Pro Ala Ala Pro Pro Thr Thr Ser Asn Arg His Ar^ Arg Gin He 
3935 3940 3945 

GAC CGG GGT GTC ACC CAC CTC AAT ATT TCA GGG CTG AAG ATG CCG AGG 
Asp Arg Gly Val Thr His Leu Asn He Ser Gly Leu Lys Met Pro Arg 
3950 3955 3960 

GGT ATC GCT ATC GAC TGG GTG GCC GGG AAT GTG TAC TGG ACC GAT TCC 
GXy lie Ala He Asp Trp Val Ala Gly Asn Val Tyr Trp Thr Asp Ser 
3965 3970 3975 

GGC CGA GAC GTG ATT GAG GTG GCG CAA ATG AAG GGC GAG AAC CGC AAG 
Gly Arg Asp Val He Glu Val Ala Gin Met Lys Gly Glu Asn Arg Lys 
3980 3985 3990 

ACG CTC ATC TCG GGC ATG ATT GAT GAG CCC CAT GCC ATC GTG GTG GAC 
Ji^ Ala He Val Val Asp 

3995 4000 4005 4OIO 

CCT CTG AGG GGC ACC ATG TAC TGG TCA GAC TGG GGG AAC CAC CCC AAG 
Pro Leu Arg Gly Thr Met Tyr Trp ger Asp Trp Gly Asn His Pro Lys 
4015 4020 ' 4025 

ATT GAA ACA GCA GCG ATG GAT GGC ACC CTT CGG GAG ACT CTC GTG CAA 
He Glu Thr 'Ala Ala Wet Asp Gly Thr Leu Arg Glu Thr Leu Val Gin 
4030 4035 4040 
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S5 M ?s M s js ?s s; sjnx 

4 055 

?Jr ?rp a2 So ^^C CGG 

4060 ^ 4^65 ^^45}S 



CTC ARC GGC ACT GAC CCC ATT GTG GCT GCT GAG AGC 
Leu Asn Gly Thr Asp Pro lie Val Ala Ala Asp Ser 
4080 4085 

AGT CAC CCC TIC AGC ATC GAT GTG TTT 6AA GAC TAC 
ser His Pro Phe Ser lie Asp Val Phe Glu Sp Sr 
• 4095 4100 ^ 

ACT TAC ATC AAT AAT CGT GIC TTC AAG ATC CAC AAfi 
Thr Tyr He Asn Asa Arg Val Phe Lys ill ^ 
4110 4115 ' 



AAA CGA GGC CTA 
Lys Arg Gly Leu 
4090 

ATC TAC 6GA GTC 
lie Tyr Gly Val 
4105 

TTT GGA CAC AGC 
Phe Gly His Ser 
4120 



CCC TTG TAC AAC CTA ACT GGG GSC CTG AGC CAT GCC TCT GAT GTA GTC 
Pro Leu Tyr Asn Leu Thr Gly Gly Leu Ser Ss Si III AsJ Sat vll 
^^^3 4130 4135 

Ifr SI G?!i S?^ f ^ ^'^^ CCC TGT GAC CGC 

4^40 Asn Pro Cys Asp Arg 

4145 4150 

AAG AAA TGC GAA TGG CTG TGT CTG CTG AGC CCC AGC GGG OCT GTC TGC 
^Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser Pro Ser gI? i?o vll S?S 

^^^'^ 4165 4170 

III Sr^ ^^"^ ^A"' «5C ACC TGT GTG CCT GTG 

Thr Cys Pro Asn Gly Lys Arg Leu Asp Asn Gly Thr Cys Val Pro Val 

4180 4185 

CCC TCT CCA ACA CCC CCT CCA GAT GCC CCT AGG CCT GGA ACC TGC ACT 
Pro ser Pro Thr Pro Pro Pro Asp Ala Pro Arg pJJ Sy ?S Jys ?hr 

4X95 4200 

CTG CAG TGC TTC AAT GGT GGT AGT TGT TTC CTC AAC GCT rnr unr onr 
Leu Gin cys Phe Asn Gly Gly Ser Cys Z SS JJS AU S^J 
4205 4210 4215 

^« f ^ ^ ^ ''^ *CA GGC GAT AAG TGT GAG CTG 

4220 4225 4230 

GAT CAG TGC TGG GAA TAC TGT CAC AAC GGA GGC ACC TGT GCG GCT TCC 
Asp Gin Cys Trp Glu Tyr Cys His Asn Gly Gly Thr Cys Ala lOa Ser 
4240 4245 4250 

P^i If I of u^? CCC ACT GGC TTC ACG GGC CCC 

Pro ser Gly Met Pro Thr Cys Arg Cys Pro.Thr Gly Phe Thr Gly So 
4255 4260 - 4265 

^ Sf T^ t?" S^? l^'' ^ TCT AAC AAC AGC ACC 

Lys Cys Thr Ala Gin Val Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr 
4270 4275 ^gSO 
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TGC ACC GTC AAC CAG GGC AAC CAG CCC CAG TGC CGA TGT CTA CCT 66C 13335 
Cys Thr Val Asn Gin Gly Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly 
4285 4290 4295 

TTC CTG GGC GAC CGT TGC CAG TAG C6G CAG TGC TCT GGC TTC TGT GAG 13383 
Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin Cy« Ser Gly Phe Cys Glu 
4300 4305 • 4310 

AAC TTT GGC ACC TGT CAG ATG GCT GCT GAT GGC TCC CGA CAA TGT CGC 13431 
Asn Phe Gly Thr Cys Gin Met Ala Ala Asp Gly Ser Arg Gin Cys Arg 
4315 4320 , 4325 4330 

TGC ACC GTC TAC TTT GAG GGA CCA AGG TGT GAG GTG AAC AAG TGT AGT 13479 
Cys Thr Val Tyr Phe Glu Gly Pro Arg Cys Glu Val Asn Lys Cys Ser 
4335 4340 4345 



CGC TGT CTC CAA GGC GCC TGT GTG GTC AAT AAG CAG ACC GGA GAT GTC 
Arg Cys Leu Gin Gly Ala Cys Val Val Asn Lys Gin Thr Gly Asp Val 
4350 4355 4360 



13527 



ACA TGC AAC TGC ACT GAT GGC CGG GTA GCC CCC AGT TGT CTC ACC TGC 
Thr Cys Asn Cys Thr Asp Gly Arg Val Ala Pro Ser Cys Leu Thr Cys 
4365 4370 4375 



13575 



ATC GAT CAC TGT AGO AAT GGT GGC TCC TGC ACC ATG AAC AGC AAG ATG 
lie Asp His Cys Ser Asn Gly Gly Ser Cys Thr Met Asn Ser Lys Met 
4380 4385 4390 



13623 



ATG CCT GAG TGC CAG TGC CCG CCC CAT ATG ACA GGA CCC CGG TGC CAG 
Met ' Pro Glu Cys Gin Cys Pro Pro His Met Thr Gly Pro Arg Cys Gin 
4395 4400 . 4405 4410 



13671 



GAG CAG GTT GTT AGT CAG CAA CAG CCT CGG CAT ATG GCC TCC ATC CTG 
Glu Gin Val Val Ser Gin Gin Gin Pro Gly His Met Ala Ser lie Leu 
4415 4420 4425 



13719 



ATC CCT CTG CTG CTG CTT CTC CTG CTG CTT CTG GTG GCT* GGC GTG GTG 
lie Pro Leu Leu Leu Leu Leu Leu Leu Leu Leu Val Ala Gly Val Val 
4430 4435 4440 



13767 



TTC. TGG TAT AAG CGG CGA GTC CGA GGG GCT AAG GGC TTC CAG CAC CAG 
Phe Trp Tyr Lys Arg Arg Val Arg Gly Ala Lys Gly Phe Gin His Gin 
4445 4450 4455 



13815 



CGG ATG ACC AAT GGG GCC ATG AAT GTG GAA ATT GGA AAC CCT ACC TAC 13863 
Arg Met Thr Asn Gly Ala Met Asn Val Glu lie Gly Asn Pro Thr Tyr . ,r 
4460 4465 4470 



AAG ATG TAT GAA GGT GGA GAG CCC GAT GAT GTC GGG GGC CTA CTG GAT 
Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp Val Gly Gly Leu Leu Asp 
4475' 4480 4485 4490 



13911 



GCT GAT TTT GCC CTT GAC CCT GAC AAG CCT ACC AAC TTC ACC AAC CCA 
Ala Asp Phe Ala Leu Asp Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro. 

4495 4500 ' 4505 



13959 



GTG TAT GCC ACG CTC -TAC ATG GGG GGC CAC GGC AGC CGC CAT TCC CTG 
Val Tyr Ala Thr Leu Tyr Met Gly Gly His Gly Ser Arg His Ser Leu 
4510 4515 4520 



14007 
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GCC AGC ACG GAC GAG AAG C6A GAA CTG CTG G6C CGG GGA CCT GAA GAC 14 055 
Ala Ser Thr Asp Glu Lys Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp 
4525 4530 4S3S 

GAG ATA GGA GAT CCC TTG GCA TAGGGCCCTG CCqCGACGGA TGTCCCCAGA AAGC 14110 

CCCCTGCCAC AT6AGTCTTT CAATGAApCC CCTCCCCAGC CGGCCCTTCT CCGGCCCTGC 14170 
Glu lie Gly Asp Pro Leu Ala * . 
4540 4545 

CGGGTGTACA AATGTAAAAA TGAAGGAATT ACTTTTTATA TGTGAGCGAG CAAGCGA6CA 14230 

AGCACAGTAT TATCTCTTTG CATTTCCTTC CTGCCTGCTC CTCAGTATCC CCCCCATGCT 14290 

GCCTTGAGGG GGCGGGGAGG GCTTTGTGGC TCAAAGGTAT GAAGGAGTCC ACATGTTCCC 14350 

TACCGAGCAT ACCCCTGGAA GCCTGGCGGC ACGGCCTCCC CACCACGCCT GTGCAAGACA 14410 

CTCAACGGGG CTCCGTGTCC CAGCTTTCCT TTCCTTGGCT CXCTG6GGTT AGTTCA66GG 14 470 

AGGTGGAGTC CTCTGCTGAC CCTGTCTGGA AGATTTGGCT CTAGCTGAGG AAGGAGTCTT 14530 

TTAGTTGAGG GAAGTCACCC CAAACCCCAG CTCCCACTTT CA6GGGCACC TCTCAGATGG 14590 

CCATGCTCAG TATCX:CTTCC AGACAGGCOC TCCCCTCTCT AGCGCCCCCT CTGTGGCTCC 14650 

TAGGGCTGAA CACATTCTTT GGTAACTGTC CCCCAAGCCT CCCATCOCOC TGAGGGCCAG 14710 
GAAGA6TCGG GGCACACCAA GGAAGGGCAA GCGGGCAGCC CCATTTTGGG GACGTGAACG 14770 
TTTTAATAAT TTTTGCTGAA TTCCTTTACA ACTAAATAAC ACAGATATTG TTATAAATAA 14830 
AATTGTAAAA AAAAAAAAA 
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lie «yr His Gin Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu 
465 470 <75 480 

Asn Asp Gin Tyr Gly Lys Pro Gly Gly Cys Ser Asp lie Cys Leu Leu 

485 . 490 495 

Ala Asn Ser His Lys Ala Arg Thr Cys Arg Cys Arg'Ser Gly Phe Ser 

500 505 510 

Leu Gly. Ser Asp Gly. lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe 

515 520 525 . 

Leu Val Tyr Gly Lys Gly Arg Pro Gly lie He Arg Gly Met Asp Met 

530 535 540 

Gly Ala Lys Val Pro Asp Qlxx His Met He Pro He Glu Asn Leu Met 
545 550 555 560 

Asn Pro Arg Ala Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe 

565 570 575 

Ala Asp Thr Thr Ser Tyr Leu He Gly Arg Gin Lys He Asp Gly Thr 

580 585 590 

Glu Arg Glu Thr He Leu Lys Asp Gly He His Asn Val Glu Gly Val 

595 600 605 

Ala Val Asp Trp Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro 

610 615 620 

Lys Lys Thr He Ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg 
625 630 635 640 

Lys Thr Leu He Glu Gly Lys Met Thr His Pro Arg Ala He Val Val 

645 650 655 

Asp Pro Leu Asn Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro 

660 665 670 

Lys Asp Ser Arg Arg Gly Arg Leu Glu Arg Ala Trp Met Asp Gly Ser 

675 680 685 

His Arg Asp He Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly 

690 695 700 

Leu Ser Leu Asp He Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe 
70S 710 , 715 720 

Tyr Asp Arg He Glu Thr He Leu Leu Asn Gly Thr Asp Arg Lys He 

725 730 735 

Val Tyr Glu Gly Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His 

740 745 750 

Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg Ser Gly Ser Val Tyr Arg 

755 760 765 

Leu Glu Arg Gly Val Ala Gly Ala Pro Pro Thr Val Thr Leu Leu Arg 

770 775 780 

Ser Glu Arg Pro Pro He Phe Glu He Arg Met Tyr Asp Ala His Glu 
785 790 795 800 

Gin Gin Val Gly Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser 

805 810 815 

Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu 

820 825 830 

Asp Gin Val Leu Asp Thr Asp Gly. Val Thr Cys Leu Ala Asn Pro Ser 

835 840 845.. 

Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly Gin Phe Ala Cys Ala Asn 

8S0 855 860 

Asn Arg Cys He Gin Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys 
86S ^ 870 875 880 

Leu Asp Asn Ser Asp Glu Ala Pro Ala Leu Cys His Gin His Thr Cys 

885 890 « 895 

Pro Ser Asp Arg Phe Lys Cys G^Lu Asn Asn Arg .Cys He Pro Asn Arg 

900 905 910 

Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly Asn Ser Glu Asp Glu Ser 
915 920 925 
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Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys 

930 935 940 

Ala Ser Gly Arg Cys lie Pro He Ser Trp Thr Cys Asp Leu Asp. Asp 
945 950 955 960 

Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala Ser Cys Ala Tyr Pro Thr 

965 970 975 

Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys He - Asn 

980 985 990 

He Asn Trp Arg Cys Asp Asn Asp Ash Asp Cys Gly Asp Asn Ser Asp 

995 1000 1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 1020 

Ser Gly Arg Cys He Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
025 1030 1035 *^ 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu- Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 1100 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys He 
105 1110 1115 1120 

Ser Lys .Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 1130 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

.1140. 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 1180 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
185 1190 1195 1200 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg. Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe He He Phe Ser Asn Arg His Glu He 
265 1270 1275 1280 

Arg Arg He Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 , 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu 

1315 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr 
345 1350 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly 



Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala - 

1380 1385 1390 

He Ala Leu Asp Pro Arg Asp Gly He Leu Phe Trp Thr Asp Trp Asp 



1365 



1370 



13T5 
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1395 1400 1405 

Ala Ser Met Ser Gly Ala Gly Arg 
1410 1415 1420 • 

Arg Thr lie His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
425 1430 1435 '1440 

Thr Val Asp Tyr Leu Glu Lys Arg lie Leu Trp lie Asp Ala Arg Ser 
n ^^^^ 1450 1455 

Asp Ala lie Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

.Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
505 ^ ^ 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 
M n . 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 15<5 1550 

Leu Cys Leu He Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu lie Arg Gly Val Asp Leu Asp 
585 1590 1S95 1600 

Ala Pro Tyr Tyr Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala He hys Arg Ala Phe lie Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
665 1670 1675 1680 

Asn Lys Lys Gin He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He 
745 1750 1755 1760 

Ser Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val He Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 • 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr ^?p Glu Ser 
S25 1830 183S 1840 

He Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 

1845 1850 1855 

Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys • 
I860 1865 1870 
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Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1875 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly He Ara Glv 

1890 1895 1900 

He Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr 



Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val 

1955 ■ 1960 1965 

Glu Gly He Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp 

1970 1975 1980 ' *^ 

Gin Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
585 1990 1995 2000 

Tyr Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Prp 

2020 2025 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
"5 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg lie 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 .2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
305 2310 2315 2320. 

Phe Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 



1925 

He Tyr Trp Val Asp Met Gly Leu 
1940 



1930 1935 
L Ser Thr He Ser Arg Ala Lys Arg 
1945 1950 



FIG. 12B 



wo 01/92474 PCT/USOl/18041 

46/91 



2340 2345 o-^^n 

Asn Gl« Leu His Pro Ser lie Met Arg Ala Ala Leu Ser GlJ Ala Asn 

2360 

Val Leu Thr Leu lie Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala 

2380 

lie Asp His Arg Ala Glj, Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 

2395 

Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu 

2410 

Lys Ser Glu^Pro Val His Pro Phe^Gly Leu Ala Val Tyr Gly Glu His 

lie Phe Trp Thr Asp Trp Val Ar/^| Ala Val Gin Arg^Jl! Asn Lys 
^'•'^ 2440 244S 

''''^Tso "'^'^ ^ Asp lie Pro Gin Gin 

2455 2460 
Pro Met Gly He lie Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
2470 2475 9dfln 

ser Pro Cys Arg lie Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 24^5 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu 

2505 2510 
Gin Glu Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys He Ser Phe Ser Leu Thr 

2530 ' 2535 2540 

Cys Asp Gly Val Ser Mis Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

, 2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val 
*595 2600 2605 

^^^oV;^ ^""^ ^""^ ^ly Asn Ser Ser Arg 

2610 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
f25 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Uu Gly Val Lys Gly Val 

„^ , 2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 

2660 2665 2670 

Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 
_ 2675 2680 2685 

^^,1 S« ^9 Pro Leu Asn Tyr Phe Ala Cys 

2690 2695 2700 

Pro Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
^05 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp 
, ^ 2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 2760 2765 

"^*«Syf ser Ser Phe Ser Cys Pro Gly 

^ 2770 2775 2780 

Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
^ .2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 
2805 2810 2815 
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Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Aan Arg Leu Cys lie 

2320 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
865 2870 . 2875 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 .2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
945 2950 2955 2960 

Lys He Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 

Cys Ser Gin Leu Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 3005 

Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu 
025 3030 3035 3040 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met He 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
105 3110 3115 3120 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145- 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser He He 

3170 3175 .... 3180 

Val Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
185 3190 3195 3200 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 3225 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 

3235 3240 3245 . 

Asp Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
265 3270 3275 3280 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 
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3290 3295 
Val Asn Asn Gly Gly Cys Ser Asn leu Cys Leu Leu Ser Pro Gly Gly 

3305 3310 
Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 

3320 332S 

^^3l5S ^'",?^t ^^'^ '^^^ ^" val cys Lys Asn 

^^-^^ 3335 3340 

Asp Lys cys He Pro. Phe Trp Ttp Lys Cys Asp Thr Glu Asp Asp Cys 

3350 3355 33S0 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 ^ 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3385 3390 

lie Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 

^^35 3400 3405 

^"aJ^o !*" ^^"^ cys Thr Asn Thr 

^*Ag ^ 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

•'445 , 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 
„ , „ 3460 3465 34^2 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3495 3500 
Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
^, . 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

^, „ -^^^^ 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 
3555 3560 356S 

n« '^'"^ ^""^ ^""^ '^y* Ser Glu Ser Glu Phe Phe Cys Ala 
3570 3575 35^^ 

Asn Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
3590 3595 3500 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 
„ , 3620 3625 3630 

cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 
3635 3640 3645 

^,1^5 ^^"^9 Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
3670 3 67 5 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe He Cys 
- ^ . 3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 

3700 3705 3710 

He Gly Arg Gin Cys Asp Gly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
3750 3755 3760 
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Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp 
^ . 3770 3775 

cys Ser He Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3^^^ 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 
3795 3800 3805 

^^^.Syf Gin Pro Gly Cys Gin 

3810 . 3815 . 3820 

Asp He Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
®25 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr Hts Asn Thr Cys Lys Ala Glu Gly ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

He Ala Asp Asp Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His 

, 3875 3880 3885 

^^^^iS Asp Glu Ser Val Arg lie Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp • 
90S 3910 3915 3920 

His Thr Gly Thr lie Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

o ^^^^ 3^3° 3935 

Thr Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His 

^ 3940 3945 3950 

Leu Asn lie Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp 

3955 3960 3955 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
585 3990 3995 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met' 

4005 4010 * 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro 

4035 4040 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro 

4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg 

4100 4105 4110 

Val Phe Lys He His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 

4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 

4165 4170 4175 

Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 

4210 4215 4220 

Pro Atg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 



FIG. 12B 



wo 01^2474 PCT/USOl/18041 

50/91 



4230 4235 
Cys His Asn 61y Gly Thr Cys Ala Ma Ser Pro Ser Gly Met Pro T^r 

4250 • 4?55 

cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

i^oxj 4265 4770 

cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 
*2T5 4280 4285 

**"429S ^''^iisl ^" Asp Arg Cys 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glo Asn^PhS Gly Thr Cys Gin 

4310 4315 4320 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

^325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4360 4365 

^^^S '^^^ Asp His cys Ser Asn 

4375 4380 
Gly Gly Ser Cys Thr Met Asn Ser Lys Het Met Pro Glu Cys Gin Cys 

4395 4400 
Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4^05 4410 4415 

Gin Gin Pro Gly His Met Ala Ser lie Leu lie Pro Leu Leu Leu Leu 

4^20 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 
4435 4440 4445 ^ ^ ^ 

^*^^^c? '^^^ P*'^ «" Gin Arg Met Thr Asn Gly Ala 

4450 4455 ^^gQ 

Met Asn Val Glu lie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 

465 4470 4475 '4485 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu 

4530 4535 4540 

Ala 
545 
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GCTACAATCC ATCTGGTCTC CTCCAGCTCC TTCTTTCTGC AAC ATG GGG AAG AAC 

Met Gly Lys Asn 
1 



^ Su l"" f CTC TTG GTC CTC CTG CCC 

Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro 
* 10 15 20 



55 



103 



151 



199 



tir So Sf i vl? c"^- '^^'^ ^'^^ ^TG GTT CTG GTC CCC 

Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro 
25 30 35 

III SS III If ^2"" """^ ^ ^ TGT GTC CTT CTG AGC 

Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser 
40 45 50 

TAC CTG AAT GAG ACA GTG ACT GTA ACT GCT TCC TTG GAG TCT GTC AS6 247 
Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arc 
55 60 65 

GGA AAC AGG AGC CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC 295 
Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu 

75 80' 

CAC TGT GTC 6CC TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA 343 

His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val 
^5 90 95 

ATG TTC CTC ACT GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG 391 

Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys 

105 110 115 

CGG ACC ACA GTG ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG 439 
Argr Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin 
120 125 130 

ACA GAC AAA TCA ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT 487 
Thr Asp Lys Ser lie Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val 
135 140 145 

GTC TCC ATG GAT GAA AAC TTT .CAC CCC CTG AAT GAG TTG ATT CCA CTA 535 
Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu Leu lie Pro Leu 
ISO 155 160 

GTA TAC ATT CAG GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT 583 
Val Tyr lie Gin Asp Pro Lys Gly Asn Arg lie Ala Gin Trp Gin Ser 
1" 170 175 180 

TTC CAG TTA GAG GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA 631 
Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe Prp Leu Ser Ser 
185 190 195 

GAG CCC TTC CAG GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT 679 
Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser" Gly 
200 205 210 

GGA AGG ACA GAG CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG 727 
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* 775 



823 



871 



919 



967 



1015 



Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys 

220 225 

51 §^ s^ Si s s f ^ jg - iji 

235 240 

Su SI? Kn ^^ S'f l^"" ^ '^'^^ TAT 66G AAG CCT 6TC 

Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val 

250 255 . ' 260 

CCT GGA CAT GTG ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GOT TCC 
Pro Gly His Val Thr Val Ser lie Cys Arg Lys Tyr til Sa Ser 
265 270 275 

Sp Ifs 2s f ' ^ WVA TTC AGT GGA 

Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly 

285 290 • 

52 lit u^^ S?^ ^^'^^ CAA GTA AAA ACC AAG GTC 
Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val 

295 300 305 

III ^'^^ ^ ^^"^ ATG AAA CTT CAC ACT GAG GCC 

Gin Leu Lys Arg Lys Giu Tyr Glu Met Lys Leu His Thr Glu Ala 
3**' 315 320 

CAG ATC CAA GAA GAA GGA ACA GTG GTG GAA TTG ACT GGA A6G CAG TCC 1063 
Gin lie Gin Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser 

330 . 335 340 

^11 r^, i^K^ ^"^^ TCA TTT GTG AAA GTG GAC 

Ser Glu lie Thr Arg Thr lie Thr Lys Leu Ser Phe Val Lys Val Asp 
345 350 ' 

ir-^^ GGG CAG GTG CGC CTA GTA 1159 

Ser His Phe Arg Gin Gly lie Pro Phe Phe Gly Gin Val Arg Leu Val 
360 365 370 

Itl ^ S^f AAA GTC ATA TTC ATC AGA GGA 1207 

Asp Gly Lys Gly Val Pro lie Pro Asn Lys Val lie Phe He Arg Gly 
375 380 385 . 

AAT GAA GCA AAC TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT 1255 
Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu 
390 395 400 

GTA CAG TTC TCT ATC AAC ACC ACC AAC GTT ATG GGT ACC TCT CTT ACT 1303 
yal Gin Phe Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr 
'05 410 415 

GTT AGG GTC AAT TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG 1351 
vai Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp 
425 430 ^ 435 

GTG TCA GAA GAA CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC 1399 
Val Ser Glu Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe " 
440 445 450 
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1495 



1543 



1591 



480 ^ 

z ?s z S5 ^ s r ?« "° 

485 490 ^ ^® Leu He Met 

6CA AAG G6A GGC ATT GTC CGA ACT nnn n^m 

Ala Lys Gly Gly He Val ArJ ?Sr gS ?Sr Hil r?"" ^""^ 

505 ^ ^""-y ^'^^ Gly Leu Leu Val Lys 

515 

CAG GAA GAG ATG AAG GGC CAT TTT Trr a^o 

Gin Giu X JiJ ;2 s S5 s 

z SI? ti: z z ifr s - - SJJ - - s 

560 

565 ™® Gin Ser Leu Pro Ala 

575 580 

12 Jff s s ?s S= S S 0^ 



1879 



feZO 625 

S5 S S= ?S iJ5 - ?s is: 

655 660 

?s Sirs; ^ s:? - ?k si? 

670 675 

Si JSe ?S J^n ser JIs f ^ ^ ^ ^-^^ ^^<' «^ 

Aia pne Thr Asn Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu 

685 690 



1975 



2023 



?071 



2119 
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CAA CAG TAT GAA ATG CAT 6GA CCT GAA GGT CTA CGT GTA 6GT TTT TAT 2167 
Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Ara Val Gly Phe Tyr 
695. 700 ^ 705 

GAG TCA GAT GTA ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA 2215 
Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu Val His Val Glu . 
710 715 720 

GAG CCT CAC ACG GAG ACC GTA CGA AAG TAC TTC CCT GAG ACA TGG ATC 2263 
Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp lie 
725 730 735 740 

TGG GAT TTG GTG GTG GTA AAC TCA GCA GGG GTG GCT GAG GTA GGA GTA 2311 
Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val 
745 750 755 

ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG 2359 
Thr Val Pro Asp Thr lie Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu 
760 765 770 

TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GOC 2407 
Ser Glu Asp Ala Gly Leu Gly lie Ser Ser Thr Ala Ser Leu Arg Ala 
775 780 785 

TTC CAG CCC TTC TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT 2455 
Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val lie Arg 
790 795 800 

GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA 2503 
Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys 
805 810 815 • 820. 

TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTT GCT 2551 
Cys lie Arg Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala 
825 830 835 

GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG 2599 
Val Pro Val Glu Lys Glu Gin Ala Pro His Cys lie Cys Ala Asn Gly 
840 845 850 

CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG 2647 
Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val 
855 860 865 

AAT TfC ACT GTG AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG 2695 
Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly 
870 875 880 

ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC 2743 
Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He 
885 890 895 900 

AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC 2791 
Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe 
905 910 ' 915 

AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC 2839 
Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser 
920 925 930 
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CTG AAA CTG CCA CCA AAT GTG GTA GAA GAA TOT GCC CGA GCT TCT GTC 
Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val 
935 940 945 

TCA GTT TTG GGA GAC ATA TTA G6C TCT GCC ATG CAA AAC ACA CAA AAT 
Ser Val Leu Gly Asp lie Leu Gly Ser Ala Met Gin Asn Thr Gin Asn 
950 955 ' 960 

CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT 
Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe 
965 970 975 530 

GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT 
Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Glh Leu 
985 990 99s 

ACT CCA GAG GTC AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC 
Thr Pro Glu Val Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr 
1000 1005 1010 

CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT 
Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe 
1015 1020 1025 

GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC ACC TGG CTC ACA GCC TTT 
Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe 
1030 1035 1040 

GTT CTG AAG ACT TTT GCC CAA GCT CGA GCC TAC ATC TTC ATC GAT GAA 
Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu 
'.O^S 1050 1055 1060 

GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC 
Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp 
1065 1070 1075 

AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG 
Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys 
1080 1085 1090 

GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC 
Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala 
1095 1100 1105 

CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC 
Leu Leu Glu He Pro Leu Thr Val Thr HJ.S Pro Val Val Arg Asn Ala 
1110 1115 . 1120 

CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT 
Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His 
1125 1130 1135 1140 

GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG GCC TAT GCT TTT GCC CTG 
Gly Ser His Val Tyr Thr Liys Ala Leu Leu Ala Tyr Ala Phe Ala Leu 
1145 1150 ' 1155 

GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG 
Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu 
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aS vll JJs ^ CGC CCT CAG AAA 3607 

GXu Ala yal Lys Lys Asp Asn Set Val His Trp Glu Arg Pro Gin Lys 

H80 1185 

Pro Jvf SI? '^"^ CAG GCT CCC TCT GCT 3655 

iSo ^ ^^^^i" Gin Ala Pro Ser Ala 

i^^o 1195 1200 

SI? mu 21? i^?^ V''^ TAT CTC ACG GCC CAG 3703 

Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin 

1210 . 1215 1220 

^ fSl Sr« ^ "° GCA ACC AAC ATC GTG AAG 3751 

Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys 
1225 1230 1235 

Irl ??f Jk? «5C 6GT TTC TCC TCC ACC CAG 3799 

Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin 
1240 1245 1250 

f ? ^ ^ AAA TAT GGA GCC GCC ACA 3847 

Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr 
1255 1260 1265 

III °CT GCA CAG GTG ACT ATC CAG TCT TCA GGG 3895 

Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly 
1270 1275 1280 

ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC AAC AAT CGC CTG TTA CTG 3943 
Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu leu 
1285 1290 1295 ijoq 

?^ ^ "^^^ ^'^ CTG CCT GGG GAA TAC AGC ATG AAA GTG 3991 

Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val 
1305 1310 1315 

ACA GGA GAA GGA TGT GTC TAC CTC CAG ACC TCC TTG AAA TAC AAT ATT 4039 
Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He 
1320 1325 1330 

CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG 4087 
Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu 
1335 1340 1345 

CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC 4135 
''^^ Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser 
1350 1355 1360 

CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC 4183 
Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He 
13^5 1370 1375 1380 

GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG 4231 
Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro Thr Val 
1385 1390 1395 

AAA ATG CTT. GAA AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC "4279 
Lys Met Leu Glu Arg" Ser Asn His Val Ser Arg Thr Glu Val Ser Ser 
1400 1405 1410 
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SJI SI? JIS T?^ V"" I'''' CAG ACA CT6 A6C 

«As yal leu He Tyr Leu Asp tys Val Ser Asn Gin Thr Leu Ser 

1420 1425 

TTG TTC TTC AC6 GTT CTG CAA GAT 6TC CCA GTA AGA GAT CTC AAA rcA 
Leu Phe Phe Thr Val Leu Gin Asp Val Pro vll Srg LeS 5^ p2 
•i-^ou 1435 1440 

f?f tT^ vI? 2*' ^''^ ""^^ '^CG GAT GAG TTT GCA ATC " 

Ala He Val Lys Val Tj^ Asp Tyr Tyr Glu Thr Asp Glu Phe Ala He 

1450 1455 i4gQ 

f I'^ ^ ^SA'f CTT GGA AAT GCT TGAAGACCA 

Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 1 

CAAGGCTGAA AAGTGCTTTG CTGGAGTCCT GTTCTCT6AG CTCCACAGAA GACACGTGTT 
TTT6TATCTT TAAAGACTTG ATGAATAAAC ACTTTTTCT6 gTC <5A<»<^<3T«3TT 
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Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu 

IS 10 15 

His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn 

20 25 30 

Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg 

35 40 45 

Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val 

50 55 60 

Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu 
65 70 75 80 

Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr 

85 90 95 

Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys 

100 105 110 

Ser He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met 

115 120 125 

Asp Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He 

130 135 140 

Gin Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu 
145 150 155 160 

Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe 

165 no 115 

Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr 

180 185 190 

Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val 

195 200 205 

Gin Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn 

210 215 220 

Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His 
225 230 235 240 

Val Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His 

245 250 255 

Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn 

260 265 270 

Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu 

275 280 285 

Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin 

290 295 300 

Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He 
305 310 315 320 

Thr Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe 

325 330 335 

Arg Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys 

340 345 350 

Gly Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala 

355 360 365 

Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe 

370 '375 380 

Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val 
385 390 395 400 

Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu 

405 ^410 415 

Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser 

420 425 430 

Lys Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly 

435 440 445 

His Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu 

45a 455 460 

Leu Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly 
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Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro Asn 

950 955 ggjj 

lie Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu 

965 970 g,c 

Val Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin 

980 985 

Leu Asn. Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg 
, 1000 . . . 1005 . 

^^'iA^K ^^"^ '■'P Thr Ala Phe Val Leu Lys 

1015 1020 
Thr Phe Ala Gin Ala Arg Ala Tyr He Phe lie Asp Glu Ala His lie 
°C 1035 1040 

Thr Gin Ala l<eu lie Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys 

« „ 1050 1055 

Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val 
r.-, , , 1065 1070 

Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu Glu 
„ 1080 1085 

,nSn Val Val Arg Asn Ala Leu Phe Cys 

1090 1095 i^j^oO 

Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His 
1110 1115 1120 

Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn 

^. . , 1130 1135 

Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala Val 

1140 1145 1^50 

Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys Ala 
115S 1160 1165 

^",Y2i "^^ "^y^ Gin Ala Pro Ser Ala Glu Val Glu 

1"0 n-;5 1180 

Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro 
It ^ , 1195 1200 

Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp lie Thr 

1205 1210 1215 

Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin Asp Thr Val 
, - , 1220 1225 1230 

Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg 

1235 1240 1245 

Thr Gly Lys Ala Ala Gin V^l Thr He Gin Ser. Ser Gly Thr Phe Ser 

1250 1255 1260 

Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val 
265 1270 1275 1230. 

ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu 

« , ^290 1295 

Gly Cys Val 2yE LS& Sin ShK geg Ijga £ya Tar Asn He Leu Pro Glu 
1300 1305 1310 

1330 1335 1340*" — ^^•*— • 

1380 1385 1390 

|j?A^iie:jTyr.;teii::j5£i 

1395 • 1400 ' 1455' 
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S?SS ^^S?'^^ AGGAGGCeCA AACAA6GGSA 6CCCCCA6AG 60 

?a?SS?J? CG^JS^^^i CGCCCCCCAC CCCCCCT^ ill 

TTCGCCCGGG gSgGgSa GAgSgc^Ap r^rTr^K^?^ TCCGAGAT6G GGCTGT6AGC 180 

TGGATTTCGG GGcJgggSS gScCG? " 



Met Leu Thr 
1 



10 15 

aS ?S asp Sa So ^ GCC TGC AGa' 571 

Ala He Asp Ala Pro Lya Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg 

30 35 

Isl ne III ?r ""^ GAC GGT GAG AGG 619 

Asp Gin lie Thr Cys He Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg 

*° 45 50 

Ss Asn gS ^'''^ ^^"^ ^'^^ TGT CCA CAG AGT 667 

ASP Cys Pro Asp Gly Ser Asp Glu Ala Pro Glu He Cys Pro Gin Ser 

53 60 65- 

Sis Sa Gin Ira III rf o"^ '^'^ ^'^^ ^<^<= ^TG GGT ACT GAG 715 
hys Ala Gin Arg Cys Gin Pro Asn Glu His Asn Cys Leu Gly Thr Glu 

70 75 80 

S2 Ivl vll Prl S f CTC TGC AAT GGG GTC CAG GAC TGC ATG 763 
Leu Cys val Pro Met Ser Arg Leu Cys Asn Gly Val Gin Asp Cys Met 

Sd llr rf l""^ ^^^G ^^C CAA GGC AAC TGC 811 

Asp Gly Ser Asp Glu Gly Pro His Cys Arg Glu Leu Gin Gly Asn Cys 

Vll SS^ f^"^ ^'^^ ^'^ CCC ACA CTC GAT GGG CCC 859 

Ser Arg Leu Gly Cys Gin His His Cys Val Pro Thr Leu Asp Gly Pro 

It^ rf c"^"" ^^^^^ CAG GCA GAT GGC AAG AOC 907 

Thr Cys Tyr Cys Asn Ser Ser Phe Gin Leu Gin Ala Asp Gly Lys Thr 
"a 140 145 

Itl ir t"' ™C GGC ACC TGC AGC CAG CTA " 955 

Cys Lys Asp Phe Asp Glu Cys Ser Val Tyr Gly Thr Cys Ser Gin Leu 

155 160 

Th^ ^k'^ t'^^ '^''^ TGT GGC TGT GTT GAA GGA TAG 1003 

Cys Thr Asn Thr Asp Gly Ser Phe He Cys Gly Cys Val Glu Gly Tyr 

I'^O 175 

2S III r^t l"'^ ^ CCC AAG AAC GAG CCA GTA A051 

Leu Leu Gin Pro Asp Asn Arg Ser Cys Lys Ala Lys Asn Glu Pro Val 

185 
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200 ^^'^ J^eu Ala 

210 

?K ?^ s IS ?s s if* -^-^ 

215 • Pro Thr Ser Thr 

. 220 225 

CGG CAG ACC ACA GCC ATG GAC Ttc a«r 

Arg Gin Thr Thr Ala Met So J*'^ TGC 

230 Asn Glu Thr Val Cys 

235 240 

Sp SI? S?s ?II Sy ^2 S! ^ f^?^ CTC AAG TGT GCC 
245 ^ f |5 Gin Gin -Leu Lys Cys Ala 



S5 jj^ S5 ^ ?f 

260 ^ oif ™® Asp Glu His Thr lie Asn lie 

" 270 275 



285 290 

Z Hi III £j So ?r f ^ f " '^^'^ '^'^ ^<^^ 

29S ^ P '^P A*^g lie Phe Val Cys 

300 

Jin J?; jj:; Sp is ;s js^ r 

s «sn biy Asp Thr Cys Val Thr Uu Leu Asp Leu Glu Leu Tyr 

320 

335 

?£r ASP ?JJ X ATC CCA AAG GTG 6AA CGC TGT GAC ATG GAT GGG 
Tjr Asp Tyr Gly Gin Ue Pro Lys Val Glu Arg Cys Asp Met Asp Gly 

-350 

S« ^"^^ ^ <5AC AGC AAG ATT GTG TTT COT CAT GGC 

cm Asn Arg Thr Lys Leu Val Asp Ser Lys lie Vai III Ss Gly 
" 365 

f?f f ^® CGC CTT GTC TAC TGG GCA GAT GCC TAT 

He Thr- Leu Asp Leu Val Ser Arg Leu Val Tyr Jrp aS lyl 

^ 380 



2S Sp S gJ5 ?S S'^ J'^" ^ CAG ACC 

ASP Tyr He Glu Val Val Asp Tyr Glu Gly Lys Gly Arg Gin Thr 

395 400 

ill «f S?^ '^V^ ^ '^'^^^ CAC CTG TAC GGC CTG ACT GTG TTT 

He lie Gin Gly He Leu lie Glu His Leu Tyr Gly Su Jhl 51? 

410 

gJS i^l ?5r S5 itl ^^"^ f ^'^^ <5<^<^ CA<5 CAG 

^lu Asn Tyr Leu Tyr Ala Thr Asn Ser Asp Asn Ala Asn Ala Gin Gin 

430 



1147 



1195 



1243 



1291 



1339 



1387 



1435 



1483 



1531 



1579 



1627 



1675 



1723 



1771 



FIG- 14A 



wo 01/92474 PCT/USOl/18041 

64/91 



?S ser 5S ne Si v^? ^ ^'^^ ^ ^^C GAG TAC CAG 
tys Thr Ser Val lie Arg Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin 

' 445 

5" SS ?S Sa f ^ ^f"^ ^ TAC CAC CAG 

val val Thr Arg Val Asp Lys Gly Gly Ala Leu His lie Tyr His Gin 
455 460 465 

AGG CGT CAG CCC CGA GTG AGG AGC cAt GCC TGT GAA AAC GAC CAG TAT 
Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys gJS JiS Sn lyl 

4''5 4Q0 

G6G AAG CCG GGT GGC TGC TCT GAC ATC TGC CTG CTG GCC AAC AGC CAC 
Gly Lys Pro Gly Gly Cys Ser Asp He Cys Leu Su Sa JJn Jer 5Js 

490 495 

i^'^^^l^^ "^^^ ^<^T CGC TTC AGC CTG GGC AGT GAC 2011 

Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp 
505 ' 5i| 



545 



1819 



1867 



1915 



1963 



^'^^ CAT GAG CTG TTC CTC GTG TAT GGC 2059 
Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe Leu Val Tyr Gly 
520 525 530 

AAG GGC CGG CCA GGC ATC ATC C6G GGC ATG GAT ATG GGG GCC AAG GTC 2107 
Lys Gly Arg Pro Gly He He Arg Gly Met Asp Met Gly Ala Lys vS 

535 - 



Prf Si« rf St^ rT^ '^TT GAA AAC CTC ATG AAC CCC CGA GCC 2155 

Pro Asp Glu His Met He Pro He Glu Asn Leu Met Asn Pro Arg Ala 
550 555 560 

^ 11^ GAG ACC GGC TTC ATC TAC TTT GCC GAC ACC ACC 2203 

Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe Ala Asp Thr Thr 
3o3 570 575 

AGC TAC CTC ATT GGC CGC CAG AAG ATT GAT GGC ACT GAG CGG GAG ACC 2251 
Ser Tyr Leu lie Gly Arg Gin Lys He Asp Gly Thr Glu Arg Glu Thr 
580 585 590 ' jgg 

ATC CTG AAG GAC GGC ATC CAC AAT GTG GAG GGT GTG GCC GTG GAC TGG 2299 
He Leu Lys Asp Gly He His Asn Val Glu Gly Val Ala Val Asp Trp 
600 605 610 

ATG GGA GAC AAT CTG TAC TGG ACG GAC GAT GGG CCC AAA AAG ACA ATC 2347 
Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro Lys Lys Thr He 

620 625 

cfS S''? ^ CTG GAG AAA GCT GCT CAG ACC CGC AAG ACT TTA ATC 2395 
ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg Lys Thr Leu He 
630 635 640 

GAG GGC AAA ATG ACA CAC CCC AGG GCT ATT GTG GTG GAT CCA CTC AAT 2443 
Glu Gly Lys Met Thr His Pro Arg Ala He Val Val Asp Pro Leu Asn 
645 650 655 

GGG TGG ATG. TAC TGG ACA GAC TGG GAG GAG GAC CCC AAG GAC AGT CGG 2491 
Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro Lys Asp Ser Arg 
660 665 670 675 
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680 ^f? ^''^ Arg Asp He 

s ?s? IS ^5 s ?s r «f c« 

695 " Ser I^u Asp 

JS s SI ^ ?^ ?s ^" ^ 

710 ^ Phe Tyr Asp Arg Xle 

720 

§js ?s Si ss S5 isi IS ?s sj^ Ips - - - 
s sff ^ rjo^ s SI si! iss s 

" 750 755 

z ?s ?s LI IS S5 s S5 

"^^^^ 785 

^25 830 835 

^ '^^^ TTG GCG AAC CCA TCC TAC GTG CCT CCA 

Asp Ala Asp Gly Val Thr Cys Leu Ala Asn S JJr 

845 

5S IK S |S IS IJS JII - S JSS 5S 

®" 860 865 

Sn gJu So S iSf ^'"'^ ^^'^ '^'^ ^<5C CTG GAC AAC AGT. 
Gin Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys Leu Asp Asn Ser 

" «"5 880 ■ 

siir.sssjsSc^sisssK?s?|5S|s^^ 

910 915 



2539 



2587 



2635 
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2731 
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3019 
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?^ S° JSS SS S; r 5f« 

920 Ala Thr Cys 

IS S5 s° ?s iss r r ^ 

935 «i« ^^'^ Cys Ala Ser Gly Atg 

945 

?ys S pS S sS ?S r '^^^ ^'^^ <^^'^ 

^''P ^ Asp Asp Asp Cys Gly Asp 



985 - j-g 

1005 1010 

IS ^ ?5= as r= ^ «<= 

1015 1020 ^^''loas '^^ 

103? ^ ,n^? Cys Gly Asp Tyr 

1040 

« ?s s= s as s js js^ s 

SI S ^JS IS ^'5 L« S S5 ^ 

'^"^^ 1070 1075 

Zll So ?S Sa S''^ "'^^ ATG GAC TCC 

pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp Cys Met Asp Ser 

1085 1090 

r i^?, f^f f 2?^ ACC CAC GTC TGC GAT CCC AGT 

f Val Thr His Val Cys Asp 
1100 1105 

vll Ss III gS lis iJf f l"^ '^'^ Arc AGC AAA GCG TGG 

vai I.ys Phe Gly Cys Lys Asp Ser Ala Arg Cys lie Ser Lys Ala Trp 

1115 1120 



. — i^,^ ^^Q^ 

ser ASP Glu^Lys Ser Cys Glu Gly Val Thr His vll Ifs AsJ So Kr 



?a? ?5s SI f'' f ^'^^ AAC TCG GAC GAG GAG AAC 

val Cys Asp Gly Asp Asn Asp Cys Glu Asp Asn Ser Asp Glu Glu Asn 

1130 1135 

l^^ AGG CCA CCC TCG CAC CCT TGT GCC AAC AAC 

cys Glu ser I^u Ala Cys Arg Pro Pro Ser His SJ l?s JJS JJS 

1150 1155 



3259 



3307 



3355 



975 

s ?s s S---- - - !SS ?^ 



3451 



3499 



3547 
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3643 
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?hr ^ SI? S^"^ ^ GAT GGC GAC GAC 3979 

Ttir Ser Val Cys Leu Pro Pro Asp Ly3 Leu Cys Asp Gly Asn Asp Asp 

lli??t r?^ ^'^'^ *^G^ ^'^'^ TGC GAC CAG TGC TCT CTG 4027 

cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu 
1175 1180 1185 

AAT AAC GGT GGC TGC AGC CAC AAC TGC TCA GTG GCA CCT GGC GAA GGC 4075 
Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala Pro Gly Glu Gly 

1195 1200 

rll SI? 1°"^ l^^ l^^ GAS CTG 6G6 CCC GAC AAC CAC 4123 

iSi ^ro l^M eiy Met Glu Leu Gly Pro Asp Asn His 

1210 1215 

Jhr ??f 1°^ ^ ^« AAA TGC AGC CAA 4171 

Thr Cys Gin lie Gin Ser Tyr Cys Ala Lys His Leu Lys Cys Ser Gin 
1220 1225 1230 1235 

AAG TGC GAC CAG AAC AAG TTC AGC GTG AAG TGC TCC TGC TAC GAG GGC 4219 
Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser Cys Tyr Glu Gly 
1240 1245 1250 

?S SI? ?r f'^^ '^'^^ CGC AGC CTG GAC CCC TTC 4267 

Trp Val l*u Glu Pro Asp Gly Glu Ser Cys Arg Ser Leu Asp Pro Phe 
1255 1260 1265 

AAG- CCG TTC ATC ATT TTC TCC AAC CGC CAT GAA ATC CGG CGC ATC GAT 4315 
Lys Pro Phe lie lie Phe Ser Asn Arg His Glu lie Arg Arg lie Asp 
1270 1275 1280 

fll Sf^ 9.?^ '^GC GTC CTG GTG CCC GGC CTG CGC AAC ACC 4363 

Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly Leu Arg Asn Thr 
1285 1290 1295 

ATC GCC CTG GAC TTC CAC CTC AGC CAG AGC GCC CTC TAC T6G ACC GAC 4411 

'^^P ^^'^ Gin Ser Ala Leu Tyr Trp Thr Asp 

1300 1305 1310 1315 

GTG GTG GAG GAC AAG ATC TAC CGC GG6 AAG CTG CTG GAC AAC G6A GCC 4459 
Val Val Glu Asp Lys lie Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala " 
1320 1325 1330 

CTG ACT AGT TTC GAG GTG GTG ATT CAG TAT GGC CTG GCC ACA CCC GAG 4507 
Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu Ala Thr Pro Glu 
1335 1340 1345 

GGC CTG GCT GTA GAC TGG ATT GCA GGC AAC ATC TAC TGG GTG GAG AGT 4555 
Gly Leu Ala Val Asp Trp lie Ala Gly Asn He Tyr Trp Val Glu Ser 
.1350 1355 1360 

AAC CTG GAT CAS ATC GAG GTG GCC AAG CTG GAT GGG ACC CTC CGG ACC 4603 
Asn Leu Asp Gin 116 Glu Val Ala Lys Leu Asp Gly Thr Leu Arg Thr 
1365 1370 1375 ^ 

ACC CTG CTG GCC GGT. GAC ATT GAG CAC CCA AGG GCA ATC GCA CTG GAT 4651 
,I»f Gly Asp He Glu His Pro Arg Ala He Ala Leu Asp 

1380 1385 1395 139I 
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So S2 f'^ 2r GCC AGC CTG CCC 4699 

Arg A3P Gly Ue Leu Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro 

'^"O 1405 1410 

lie Su S S?*^ M^*^ '^^^G CGC CGC ACC GTG CAC 

Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg Arg Thr Val His 

1420 1425 

So llu ?f 2*^° ^ ACC GTG GAC TAC 4l95 

Arg Glu Thr Gly Ser Gly Gly Trp Pro Asn Gly Leu Thr Val Asp Tyr 
^^^^ 1435 1440 

S!! ^ ?^ ^'^C GCC AGG TCA GAT GCC ATT TAC 4843 

Lea eiu Lys Arg He Leu Trp lie Asp Ala Arg Ser Asp Ala He iyr " 
"•^ 1450 1455 

1'^^ CAC ATG GAG GTG CTT CGG 6GA CAC 4891 

Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val Leu Arg Gly Ss " 
1465 1470 147S 

GAG TTC CTG TCG CAC CCG TTT GCA GTG ACG CTG TAC 666 6G6 GAG GTC 4939 
Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr Gly Gly Glu Val 
"80 14BS 1490 

?Sr Jrn l^^ f!^ ^^'^ ®^ ^^C AAC AAG TGG 4987 

Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp 
1435 1500 1505 

?S «ff u^? P"^? ^"^^ ^ ACC AAC ACC CAG CCC TTT 5035 

Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn Thr Gin Pro Phe 
1510 1515 1520 

a^ S*^? "^^^ "^CC CGC CAG CCC ATG GCT CCC AAT CCC 5083 

Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met Ala Pro Asn S 
1^25 1530 1535 

III CGC CAG GGC CCC TGC TCC CAC CTG TGT CTC ATC 5131 

Cys Glu Ala Asn Gly Gly Gin Gly Pro Cys Ser His Leu Cys Leu He 
1545 1550 1555 

AAC TAC AAC CGG ACC GTG TCC TGC GCC TGC CCC CAC CTC ATG AAG CTC 5179 
Asn Tyr Asn Arg Thr Val Ser Cys Ala Cys Pro His Leu Met Lys Leu 
1560 1565 1570 

CAC AAG GAC AAC ACC ACC TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC 5227 
His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr 
157S 1580 1585 

GCA CGT CAG ATG GAG ATC CGA GGT GTG GAC CTG GAT GCT CCC TAC TAC 5275 
Arg Gin Met Glu He Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr 
1590 1595 1600 

AAC TAC ATC ATC TCC TTC ACG GTG CCC GAC ATC GAC AAC GTC ACA GTG 5323 

^^1 A»P lie Asp Asn Val Thr Val 
1^05 1610 1615 

CTA GAC TAC GAT GCC CGC GAG CAG CGT GTG TAC TGG TCT GAC GTG CGG 5371 
Leu Asp Tyr Asp Ala Arg Glu Glti Arg Val Tyr Trp Ser Asp Val Arg 
1620 1625 1630 1635 
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5563 



JSJ 111 S?f T?"" ^"^^ "^'^'C AAC GGC ACA GGC GTG GAG ACA 
Thr Gin Ala He Lys Arg Ala Phe He Asn Giy Thr G? Si cfS 

" 1645 1650 

SIf SI? CCA AAT GGC CAC GG6 CTG GCT GTG GAC TGG 

Val Val Ser Ma Asp Leu Pro Asn Ala His Gly Leu Sa 52 Asp 1% 

1660 1665 . 

GTC TCC CGA AAC CTG TTC TGG ACA AGC TAT GAC ACC AAT AAG AAG C;^G 
val ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp ?£r JJs 

1675 1680 

ill vJ? M ^ >^ P° "^^^ AAG AAC 6CA GTG GTG CAG 

1m2 '^^y ^" As*^ Ala Val Val Gin 

1690 1695 

GGC CTG GAG CAG CCC CAT GGC CTT GTC GTC CAC CCT CTG CGT GGG AAG 5611 
Gly Leu Glu Gin Pro His Gly Leu Val Val His SS Sg S? ^s 
1705 1710 * * •'i7f5 

SS ?Jr J« S ?f ^ '^TG GAT GGC 5659 

Leu Tyr Trp Tht Asp Gly Asp Asn He Ser Met Ala Asn Met Asp Gly 
1720 1725 1,35 

'^'^^ «5GC CAG AAG GGC CCC GTG GGC CTG 5707 
Ser Asn Arg Thr Leu Leu Phe Ser Gly Gin Lys Gly Pro Val Gly Leu 
1735 1740 1745 ' 

xll R« 11"" o*^"" c^"" ^ '^'^S '^^C AGC TCC 6GG AAC 5755 

Ala lie Asp Phe Pro Glu Ser Lys Leu Tyr Trp lie Ser Ser Gly Asn 
1^50 1755 j^^gQ 

Sfl Th^ T?^ ^ ^^"^ ^^"^ GGG CTG GAG GTC ATC 5803 

^icf "* Asp Gly Ser Gly Leu Glu Val lie 

■•■'oS 1770 1775 

fl'^f ACC GCC CTG GCC ATC ATG 5851 

Asp Ala Met Arg Ser Gin Leu Gly Lys Ala Thr Ala Leu Ala He Met 
I'BO 1785 1790 1795 

r?S 2?^ ^'^^ GTG TCG GAA AAG ATG GGC ACA 5899 

Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu Lys Met Gly Thr 
1800 1805 1810 

III cfS f?^ 3!F TCG GGC TCC GTG GTC CTT CGG AAC AGC ACC 5947 
Cys Ser Lys Ala Asp Gly Ser Gly Ser Val Val Leu Arg Asn Ser Thr 
1815 1820 1825 

?l? ^A'f <»C GAG AGC ATC CAG CTG GAC 5995 

Thr Val Het His Met Lys Val Tyr Asp Glu Ser He Gin Leu Asp 
1830 1835 jg^Q 

CAT AAG GGC ACC AAC CCC TGC AGT GTC AAC AAO GGT GAC TGC TCC CAG 6043 

S« Val Asn Asn Gly Asp Cys Ser Gin 
1845 1850 1855 

CTC TGC CTG. CCC ACG TCA GAG ACG ACC CGC TCC TGC ATG TGC ACA GCC 66'91 

iftfin ^"'^ ^^"^ ser Cys Met Cys Thr Ala 

I860 1865 1870 1875 
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GGC TAT AGC CTC CGG AGT GGC CAG CAG GCC TGC GAG GGC GTA GGT TCC 6139 
Giy Tyr ser Leu Arg Ser Gly Gin Gin Ala Cys Glu Gly Val Gly Ser 
1880 1885 1890 

TTT CTC CTG TAC TCT GTG CAT GAG GGA ATC AGG GGA ATT CCC CTG GAT 6187 
Phe Leu Leu Tyr Ser Val His Glu Gly lie A^-g Gly He Pro Leu Asp 
1895 1900 1905 

CCC AAT GAC AAG TCA GAT GCC CTG GTC CCA GTG TCC GGG ACC TCG CTG B235 
Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser Gly Thr Ser Leu 
1910 1915 1920 

GCT GTC GGC ATC GAC TTC CAC GCT GAA AAT GAC ACC ATC TAC TGG GTG 6283 
Ala Val Gly lie Asp Phe His Ala Glu Asn Asp Thr He Tyr Trp Val 
1925 1930 1935 

GAC ATG GGC CTG AGC ACG ATC AGC CGG GCC AAG CGG GAC CAG ACG TGG 6331 
Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg Asp Gin Thr Trp 
1940 1945 19S0 1955 

CGT.GAA GAC GTG GTG ACC AAT GGC ATT GGC CGT GTG GAG GGC ATT GCA 6379 
Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val Glu Gly He Ala 
1960 1965 1970 

GTG GAC TGG ATC GCA GGC AAC ATC TAC TGG ACA GAC CAG GGC TTT GAT 6427 
Val Asp Trp He Ala Gly Asn lie Tyr Trp Thr Asp Gin Gly Phe Asp 
1975 1980 1985 

GTC ATC GAG GTC GCC CGG CTC AAT GGC TCC TTC CGC TAC GTG GTG ATC 6475 
Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg Tyr Val Val He 
, 1990 1995 2000 

TCC CAG GGT CTA GAC AAG CCC CGG GCC ATC ACC GTC CAC CCG GAG AAA 6523 . 
Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val His Pro Glu Lys 
2005 2010 2015 

GGG TAC TTG TTC TGG ACT GAG TGG GGT CAG TAT CCG CGT ATT GAG CGG 6571 . 
Gly Tyr Leu Phe Trp Thr Glu Trp Gly Gin Tyr Pro Arg He Glu Arg 
2020 2025 2030 2035 

TCT CGG CTA GAT GGC ACG GAG CGT GTG GTG CTG GTC AAC GTC AGC ATC 6619 
Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val Asn Val Ser He • 
2040 2045 2050 

AGC TGG CCC AAC GGC ATC TCA GTG GAC TAC CAG GAT GGG AAG CTG TAC 6667 
Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Asp Gly Lys Leu Tyr 
2055 2060 2065 

TGG TGC GAT GCA CGG ACA GAC AAG ATT GAA CGG ATC GAC CTG GAG ACA 6715 
Trp Cys Asp Ala Arg Thr Asp Lys He Glu Arg He Asp Leu Glu Thr 
2070 2075 2080 

GGT GAG AAC CGC GAG GTG GTT CTG TCC AGC AAC AAC ATG GAC ATG TTT 6763 
Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn Met Asp Met Phe 
2085 2090 2095 

TCA GTG TCT GTG TTT GAG GAT TTC ATC TAC TGG AGT GAC AGG ACT CAT 6811 
Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser Asp Arg Thr His 
2100 2105 2U0 2115 
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ITy sS ne r?^ A6C AAA GAC AAT GCC ACA GAC TCC 

Ala Asn Gly Ser lie Lys Arg 61y Ser Lya Asp Asn Ala Thr Asp Ser 

2125 2130 

6TG CCC CTG CGA ACC GGC ATC GGC GTC CAG CTT AAA GAC ATC AAA 6TC 
Val Pro Leu Arg Thr Gly He Gly Val Gin Leu J?? A?p ?Ie Sf 

2140 .. 2145 

TTC AAC CGG GAC CGG CAG AAA GGC ACC AAC GTG TGC GCG GTG GCC AAT 6955 
Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys fll vlt aS i^l 

2155 2160 

«5 l'^ ^AC CGG GGC CGT 66G CAG CGG GCC 

Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Arg Gly Sn A?g 

2170 2175 

?VS Sa III S?J ^ '''^ ^^'^^^ «GA GCA TCG TGC CGC 7051 

Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly Ala Ser Cys Arg 

^*o3 2190 2195 

Su Itl Iff ?r f^*^ S^^ CGC ACC ATT CTC AAG AGT 7099 

Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr He Leu Lys Ser 
2200 2205 2210 

ill Ms Su rf '^'^ ^"^ C<=C <^TG CAG CCC TTC 7147 

lie His ieu Ser Asp Glu Arg Asn Leu Asn Ala Pro Val Gin Pro Phe 
2215 2220 2225 

GAG GAC CCT GAG CAC ATG AAG AAC GTC ATC GCC CTG GCC TTT GAC TAC 7195 
Glu ASP Pro Glu His Met Lys Asn- Val He Ala Leu Sa III Asp lyr 
2230 2235 2240 

aS S?^ r?^ ^ ^ ^C<= CC*= CGC ATC TTC TTC AGC GAC 7243 

^ ^'^'^ Asn Arg Xle Phe Phe Ser Asp 

2245 2250 2255 

?IS SfS ir '^^C AAC GAC GAT GGC TCC AGG AG6 7291 

rie His Phe Gly Asn lie Gin Gin He Asn Asp Asp Gly Ser Arg Arg 
2260 2265 2270 2275 

ill ^ tH vI? f/? ''^ 6AA GGC CTG GCC TAT CAC .7339 

He Thr He Val Glu Asn Val Gly Ser Val Glu Gly Leu Ala Tyr His 
2280 2285 2290 

S« ^ ""^"^ ^ '^^ '■'^C ACG ACA TCC ACC ATC 7387 

Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr Thr Ser Thr He 
2295 2300 2305 

S» ^ f^?!^ S^? CAG ACC CGC CCA GGG GCC TTC GAG CGT GAG 7435 
Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala Phe Glu Arg Glu 
2310 2315 2320 

JS^ Sf rll M^? '^'^ GAC CAC CCA CGG GCC TTC 6TT TTG 7483 
oXo^ ^^'^ »is Pro Arg Ala Phe Val Leu 

2325 2330 2335 

Itl l^^ ^''^ "f^C TGG ACC AAC TGG AAT GAG CAG CAT -7531 

Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp Asn Glu Gin His 
2340 2345 2350 2355 



FIG- 14A 



wo 01/92474 PCT/USOl/18041 

72/91 



CCC AGC ATC ATG CGG GCG GCG CTC TCG GGA GCC AAT 6TC CTG ACC CTT 7579 
Pro Ser He Met Arg Ala Ala Leu Ser Gly Ala Asn Val Leu Thr Leu 
2360 2365 2370 

ATC GAG AAG GAG ATC CGT ACC CCC AAT GGC CTG GCC ATC GAC CAC CGT 7627 
He Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala He Asp His Arg 
2375 2380 2385 

GCC GAG AAG CTC TAC TTC TCT GAC GCC ACC CTG GAC AAG ATC GAG CGG 7675 
Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp Lys He Glu Arg 
2390 2395 2400 

TGC GAG TAT GAC GGC TCC CAC CGC TAT GTG ATC CTA AAG TCA GAG CCT 7723 
Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu Lys Ser Glu Pro 
2405 2410 2415 

GTC CAC CCC TTC GGG CTG GCC GTG TAT GGG GAG CAC ATT TTC TGG ACT 7771 
Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His He Phe Trp Thr 
2420 2425 2430 2435 

GAC TGG GTG CGG CGG GCA GTG CAG CGG GCC AAC AAG CAC GTG GGC AGC 7819 
Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys His Val Gly Ser 
2440 2445 2450 

AAC ATG AAG CTG CTG CGC GTG GAC ATC CCC CAG CAG CCC ATG GGC ATC 7867 
Asn Met Lys Leu Leu Arg Val Asp He Pro Gin Gin Pro Met Gly He 
2455 2460 2465 



ATC GCC GTG GCC AAC GAC ACC AAC AGC TGT GAA CTC TCT CCA TGC CGA 7915 
He Ala Val Ala Ash Asp Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg 
2470' 2475 2480 

ATC AAC AAC GGT GGC TGC CAG GAC CTG TGT CTG CTC ACT CAC CAG GGC 7963 
lie Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu Thr His Gin Gly 
2485 2490 2495 

CAT GTC AAC TGC TCA TGC CGA GGG GGC CGA ATC CTC CAG GAT GAC CTC 8011 
His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu Gin Asp Asp Leu 
2500 2505 2510 2515 

ACC TGC CGA GCG GTG AAT TCC TCT TGC CGA GCA CAA GAT GAG TTT GAG 8059 
Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu 
2520 2525 2530 

TGT GCC AAT GGC GAG TGC ATC AAC TTC AGC CTG ACC TGC GAC GGC GTC 8107 
Cys Ala Asn Gly Glu Cys He Asn Phe Ser Leu Thr Cys Asp Gly Val. 
2535 2540 2545 

CCC CAC TGC AAG GAC AAG TCC GAT GAG AAG CCA TCC TAC TGC AAC TCC 8155 
Pro His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser Tyr Cys Asn Ser 
2550 2555 2560 

CGC CGC TGC AAG AAG ACT TTC CGG CAG TGC AGC AAT GGG CGC TGT GTG 8203 
Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Ser Asn Gly Arg Cys Val 
2565 2570 2575 

TCC AAC ATG CTG TGG TGC AAC GGG GCC GAC GAC TGT GGG GAT GGC TCT 8251. 
Ser Asn Met Leu Trp Cys Asn Gly Ala Asp Asp Cys Gly Asp Gly Ser 
2580 2585 2590 2595 
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Sp nS JiS 5^?" ff^ 

sp i^xu lie Pro Cys Asn Lya Thr Ala Cys Gly Val Gly Glu Phe Arg 

2605 2610 

Ss So aJp gS is ?S rll f ^ «^<^ CAG TTT 8347- 

^ys Arg Asp Gly Thr Cys He Gly Asn Ser Ser Arg Cys Asn 61n Phe 

2620 2625 

GTG GAT TGT GAG GAG GCC TCA GAT GAG AT6 AAC TGC AGT GCC ACC GAC 8395 • 
Val Asp Cy^ Glu Asp Ala Ser Asp Glu Mat Asn Cys Ser Ala Thr Asp 
2630 2635 .2640 

TGC AGC AGC TAG TTC CGC CTG GGC GTG AAG GGC GTG CTC TTC CAG CCC 8443 
Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val LeJ Se Gil Pro 
^o93 2650 2655 

^® 'fAC CCC AGC TGG GTG TGT GAT GGC 8491 

2fifio ^ "^^^ Ser Trp Val Cys Asp Gly 

" 2670 2675 

GCC AAT GAC TGT GGG GAC TAC AGT GAT GAG CGC GAC TGC CCA GGT GTG 8539 
Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val 
2680 2 685 2 690 

AAA CGC CCC AGA TGC CCT CTG AAT TAC TTC GCC TGC CCT AGT GGG CGC 8587 
Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg 
2695 2700 2705 

TGC Arc CCC ATG AGC TGG ACG T6T>GAC AAA GAG GAT GAC TGT GAA CAT 8635 
Cys lie Pro Met Ser Trp Thr Cys Asp Lys Glu Asp Asp Cys Glu His 
2710 2715 2720 

?tS u?^ l^*^ "fCA GAG GCC CAG TTT 8683 

^,5r" Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe 

2725 2730 2735 

GAG TGC CAG AAC CAT CGC TGC ATC TCC AAG CAG TGG CTG TGT GAC GGC 8731 . 
Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp Leu Cys Asp Gly 
2740 2745 2750 2755 

AGC GAT GAC TGT GGG GAT GGC TCA GAC GAG GCT GCT CAC TGT GAA GGC 8779 
Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala His Cys Glu Gly 
2760 2765 2770 

^ ACG TGC GGC CCC TCC TCC TTC TCC TGC CCT GGC ACC CAC GTG TGC 8827 
Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly Thr His Val Cys 
2775 . 2780 2785 

GTC CCC GAG CGC TGG CTC TGT GAC GGT GAC AAA GAC TGT GCT GAT GGT 8875 
Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp Cys Ala Asp Gly 
2790 2795 2800 

GCA GAC GAG AGC ATC GCA GCT GGT TGC TTG TAC AAC AGC ACT TGX GAC 8923 

Tyr Asn Ser Thr Cys Asp 
2805 2810 2815 

6AC C6T GAG TTC ATG TGC CAG AAC CGC CAG TGC ATC CCC AAG CAC TTC 8971 - 
Asp Arg Glu Phe Met Cys Gin Asn Arg Gin Cys He Pro Lys His Phe 
2^20 2825 2830 2835 
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2p ^ «j s if r »^ °^ 

2840 ^ ^ Ala Asp Gly Ser Asp Glu Ser Pro 

2845 2850 



2665 



m s IS. - - - - 

2875 2880 

?ys Ss A« Gin ^ gJS Sf ""^^ '^^^ ^G«= '^CC AGC 

2885 ^ 2890 ^^"^ *^''289S "^^ 

P« gJu iJs ?ys J^n T'^ ^GC AGT GGG 

290S 2910 2915 

?S 5S SI Su aS J*^^ «^ GGC CAG GAT GAC TOT GGC 

Hrg cys Val Ala Glu Ala Leu Leu Cya Asn Gly Gin Asp Asp Cys Gly 
2920 2925 2930 

Sp S ser Sn ?f GAG TGT CTC AGC CGC 

Asp ser Ser Asp Glu Arg Gly Cys His lie Asn Glu Cys Leu Ser Arg 

2940 2945 



Ss 2S 5S gK ?S f 2:^^ ^'^^ ^'^^ ^-^^ '^•fC «C 
uys I-eu^Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu Lys He Gly Phe 

2955 , 2960 

iJS ?ys S Ss p2 GGC CGG ACG 

2965 ^ ^ oo^K ^" Asp Gly Arg Thr 

2970 2975 

III SI Isl vlt ?f r?'^ ^^'^ '"^^^ CCC T6C AGC CAG CGC 

29BQ ^ ^ "^^^ ^" '^'^'^ ^^'^ Gys Ser Gin Arg 

2990 2995 

ITs Til ?SJ S?« a?S f ^ J^"" TAT 
cys lie Asn Thr His Gly Ser Tyr Lys Cys Leu Cys Val Glu Gly Tyr 

3°^^ 3005 3010 



9115 
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9355 



9403 



9451 



9499 



9595 



a2 g?S af ^ n""^ o"^ <^^^ ACT GAC GAG 9547 

Ala Pro Arg Gly Qly Asp Pro His Ser Cys Lys Ala Val Thr Asp Glu 

3020 3025 

gJS ITo III III rlt IV" l^"" *^ ^AC CTG CGC AAG CTC AAC 
3S0 ^^"^ ^9 Tyr Tyr Leu Arg Lys Leu Asn 

^ 3035 3040 

S« So gS V''' ""^ «^AG GGC CTG AAC AAC GCC " 9643 

30^5 ^ " '^^J^S ^" Gin Gly Leu Asn Asn Ala 

3050 3055 

S SS S2 SI f l"^ *=A^ ATC TAC TGG ACA GAT - 9691 

3oSo ^ '^rr Arg Glu Gin Met He Tyr Trp Thr Asp 

3065 3070 *^ 3075 
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5S ?S S G?S c^"" t^t ^^'^ ATG CAC CM AAC GGG AGC 
val Thr Thr GAn Gly Ser Met lie Arg Arg Met His Leu Asn Gly Ser 

3085 3090 

52 §fn vS ^''^ ^^"^ CCC GAT GGG CTG 

Asn Val Gin Val Leu His Arg Thr Gly Uu Set Asn Pro Asp Gly Leu 
3095 3100 3105 

Sfl vl? S'? ^ T6G TGC GAC AAA GGC CGG 

Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys Asp Lys Gly Arg 

3115 3120 

rll rll rf f ? V"^ ^ "^"^ ^^CC TAT CGG ACG GTG CTG 

^^lil ^ ^^""Jt^t Ala Tyr Arg Thr Val Leu 

■9x^9 3130 3J35 

51? 5Sr SI ^ ^'^'^ ^^'^^ '^'^'^ <^ CT6 GTG GTG GAT GTG CA6 

Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val Val Asp Val Gin 
3145 3150 3155 

SI ??5 f'*' V^^ GAC CAT TCA CTG ATC GGC 

Asn Gly Tyr leu Tyr Trp Thr Asp Trp Gly Asp His Ser Leu He Gly 
3160 3165 3170 

T^f ^ S''? ^^"^ '■^^ A<^C GTC ATC GTG GAC ACC AAG 

Arg He Gly Met Asp Gly Ser Ser Arg Ser Val He Val Asp Thr Lys 
317S 3180 3185 

rll ThC l*^^ n^^ ^^"^ ^''^ T*^"* GTC ACT GAG CGC ATC 

lie Thr Trp Pro Asn Gly Leu Thr Leu Asp Tyr Val Thr Glu Arg lie 
3190 3195 3200 

TAC TGG GOC GAC GCC CGC GAG GAC TAC ATT GAA TTT GCC AGC CTG GAT 

Arg Glu Asp Tyr He Glu Phe Ala Ser Leu Asp 
3205 3210 3215 

GGC TCC AAT CGC CAC GTT GTG CTG AGC CAG GAC ATC CCG CAC ATC TTT 
•*Mn ^""^ Ser Gin Asp He Pro His He Phe 

^^2*' 3225 3230 3235 

GCA CTG ACC CTG TTT GAG GAC TAC GTC TAC TGG ACC GAC TGG GAA ACA 
Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr Asp Trp Glu Thr 
3240 3245 3250 

AAG TCC ATT AAC CGA GCC CAC AAG ACC ACG GGC ACC AAC AAA ACG CTC 
Lys Ser He Asn Arg Ala His Lys .Thr Thr Gly Thr Asn Lys Thr Leu 
3255 3260 3265 

CTC ATC AGC ACG CTG CAC CGG CCC ATG GAC CTG CAT GTC TTC CAT GCC 
Leu iJ-e Ser Thr Leu His Arg Pro Met Asp Lea His Val Phe His Ala 
3270 3275 3280 

CTG CGC CAG CCA GAC GTG CCC AAT CAC CCC TGC AAG GTC AAC AAT GGT 

%^p? ^" **** Lys Val Asn Asn Gly 

3285 3290 3295 

GGC T6C AGC AAC CTG TGC CTG CTG TCC CCC GGG GGA GGG CAC AAA TGT 

Ser Pro Gly Gly Gly His Lys Cys 
3300 3305 3310 3315 
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?S ?S fJn pnf P° ^^'^ '^'^T «^ ACC TGT GT6 10459 
Ala Cys Pro Tht Asn Phe Tyr Leu Gly Ser Asp Gly Arg Thr Cys Val 
3320 3325 3330 

IS l^^ '^'^ '^C GAC AAG TGC ATC 10507 

Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn Asp Lys Cys He 
3335 3340 33^5 

CCC TTC TGG TGG ARG TGT GAC RCC GAG GAC GAC TGC GGG GAC CAC TCA 105SS 
Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys Gly Asp His Ser 
3350 3355 33gQ 

GAC GAG CCC CCG GAC TGC CCT GAG TTC AAG TGC CG6 CCC GGA CAG TTC 10603 
^'^o P^^o Asp Cys Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe 
3365 3370 33^5 

CAG TGC TCC ACA GGT ATC TGC ACA AAC CCT GCC TTC ATC TGC GAT GGC 10651 
Gin Cys Ser Thr Gly lie Cys Thr Asn Pro Ala Phe He Cys Asp Gly 
3380 3385 3390 ' ''339s 

GAC AAT GAC TGC CAG GAC AAC ACT GAC GAG GCC AAC TGT GAC ATC CAC 10699 
Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn Cys Asp He His • 
3400 3405 3410 

6TC TGC TT6 CCC AGT CAG TTC AAA TGC ACC AAC ACC AAC CGC TGT ATT 10747 
Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr Asn Arg Cys He 
3415 3420 3425 

CCC GGC ATC TTC CGC TGC AAT GGG CAG GAC AAC TGC GGA GAT GGG GAG 10795 
Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu 
3430 3435 3440 • 

GAT GAG AGG GAC TGC CCC GAG GTG ACC TGC GCC CCC AAC CAG TTC CAG 10843 
Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro Asn Gin Phe Gin 
3445 3450 3455 

TGC TCC ATT ACC AAA CGG TGC ATC CCC CGG GTC TGG GTC TGC GAC CGG 10891 
Cys Ser He Thr Lys Arg Cys. He Pro Arg Val Trp Val Cys Asp Arg 
3460 3465 3470 3475 

GAC AAT GAC TGT GTG GAT GGC AGT GAT GAG CCC GCC AAC TGC ACC CAG 10939 
Asp Asn Asp Cys Val Asp Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin 
3480 3485 3490 

ATG ACC TGT GGT GTG GAC GAG TTC CGC TGC AAG GAT TCG GGC CGC TGC 10987 
Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp Ser Gly Arg Cys 
3495 3500 3505 

ATC CCA GCG CGT TGG AAG TGT GAC GGA GAG GAT GAC TGT GGG GAT GGC 11035 
He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly 
3510 3515 3520 

TCG GAT GAG CCC AAG GAA GAG TGT GAT GAA CGC ACC TGT GAG CCA TAC 11083 
Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr 
3525 3530 3535 

CAG TTC CGC. TGC AAG AAC AAC CGC TGC GTG CCC GGC CGC TGG CAG TGC 11131" 
Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly Arg Trp Gin Cys 
3540 _ 3545 3550 3555 
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■*560 3565. 35^jj 

CCT CGG CCC T6C TCC GAG AST GAG TTC TCC TGT GCC AAC GGC rrr Trr 
Pro Arg Pro^cys Ser Glu Ser Glu^Phe Ser Cyl Sy 2g Jy^ 

3S6S 

ill Sa Sv So ?^ J^'^ f ^'^^ ^^'^C 6CG GAC G6C 

lie Ala Gly Arg Trp. Lys Cys Asp Gly Asp His Asp Cys Ala Asp Gly 

3595 • 3600 

TCG GAC GAG AAA GAC T6C ACC CCC CGC TGT GAC ATG GAC CAG TTC CAG 
ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp nit Sp Sn III Sn 

joio 3615 

TGC AAG AGC GGC CAC TGC ATC CCC CTG CGC TG6 CGC TGT GAC GCA GAC 
cys Lys Ser Gly His Cys lie Pro Leu Arg 14 JJJ Sp t% 

3630 3635 

Sa So ?vs 21? ?f ^ T''' <^««^ ACT GGC GTG 

Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys Gly Thr Gly Val 
3640 3645 355^ 

III So fl?. l^"" 11"" ACC TTG TGC AAG 

Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn Thr Leu Cys Lys 

3660 3665 

CCG CTG GCC T6G AAG TGC GAT GGC GAG GAT GAC TGT GGG GAC AAC TCA 
Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly Lp ^ S« 
3670 3675 368O 

GAT GAG AAC CCC GAG GAG TGT GCC CGG TTC GTG TGC CCT CCC AAC CGG 
Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe Val Cys Pro Pro Jsn A^g 
3685 3690 3695 ^ 

pk*^ ^'^ ^■'^ CTG TGG ATC GGG CGC CAA 

Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp He Gly Arg Gin 
■*^00 3705 3710 '3715 

V""^ S?^ ^'^^ ^^'^ ^^"^ ACT GAT GAA GAG GAC TGT 
Cys Asp Gly Thr Asp Asn Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys 
3720 3725 3730 

GAG CCC CCC ACA GCC CAC ACC ACC CAC TGC AAA GAC AAG AAG GAG TTT 
Glu Pro Pro Thr Ala His Thr Thr His Cys Lys Asp Lys Lys Glu Phe 
3735 3740 3745 

Su ^ ?^ ^ ^CC TCC TCC CTG CGC TGC AAC ATG 

Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu Arg Cys Asn Met 
3750 3755 3750 

TTC GAT GAC TGC GGG GAC GGC TCT GAC GAG GAG GAC TGC AGC ATC GAC 
Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp Cys Ser lie Asp 
37 65 3770 3^^^ - . 

CCC AAG CTG ACC AGC TGC GCC ACC AAT GCC AGC ATC TGT GGG GAC GAG 
Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser He Cys Gly Asp Glu 
3780 3785 3790 ' 3795 
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S5 51? A?S ^ ^ ^'^^ TGC CGC TCG 

Aia Atg cys VaX Arg Thr 61u Lys Ala Ala Tyr Cys Ala Cys Arg Ser 

•"'"^ 380S ' 3810 

S5 Ss ?S SI? S G?'' ^^'^ <^*« 

wy Phe His Thr Val Pro Gly Gin Pro 61y Cys Gin Asp lie Asn Glu 

^ 3820 3825 . 

?ys 2S So G?v r^'' CAG CTC TGC AAC AAC AOC AAG GGC 
Cys leu Arg Phe Gly Thr Cys Ser Gin Leu Cys Asn Asn Thr Lys Gly 
3830 3835 3g^Q ' ' 

Hil Leu Ifs t« Jvs J?I f ^ 2!^ 

3845 38S0 "^'^ ^'^^ 

3855 

TGC AAG GCC 6AA GGC TCT GAG TAG CAG GTC CT6 TAC ATC GCT GAT car 
cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu ill SI Z Z 
3865 3870 3875 

JJI gJS Jff f oJ'' ^ ^'^^ TCG GCT TAC GAG 

Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His Ser Ala Tyr Glu 

^8^° 3885 .. 3890 

Sn III r^n r?"^ ^^"^ ''^^ ^TT GAT GCT ATG GAT GTC 

Gin Ala Phe Gin Gly Asp Glu Ser Val Arg He Asp Ala Met Asp Val 
•*»S5 3900 3905 

ill SI? it? S5I ??S v^? '^'5G CAC ACG .GGC ACC 

His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp His Thr Gly Thr 
3910 3915 3920 

JIS f c"^^ f "^"^ S^'' ^^■^ CCT ACC ACT TCC AAC 

iltl '^^^ Thr Thr ser Asn 

3925 3930 3935 

Arc Hi^ Jr^ li^ rf ^''^ ^^'^ '^"^ CTC AAC ATT TCA 

Arg His Arg Arg Gin lie Asp Arg Gly Val Thr His Leu Asn lie Ser 
3940 3945 3950 3955 

GGG CTG AAG ATG CCC AGA GGC ATC GCC ATC GAC TGG GTG GCC GGA AAC 
Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp Val Ala Gly Asn 
39fi0 3965 3970 

GTG TAC TGG ACC GAC TCG GGC CGA GAT GTG ATT GAG GTG GCG CAG ATG 
Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu Val Ala Gin Met 
3975 3980 3935 

ils i?S til S*^ f ^ ^® ?^ ^ ATG ATT GAC GAG CCC 

Lys Gly Glu Asn Arg Lya Thr Leu He Ser Gly Met He Asp Glu Pro 
3990 3995 4000 

S^? S^? ^ ATG TAC TGG TCA GAC 

AAAc ^ A*^9 Thr Met Tyr Trp Ser Asp 

4010 4015 

III '^T^S <S'^T GGG ACG CTT 

4S0 ^ ^ " ^".iXf ^^'^ ^^"^ Asp Gly Thr Leu 

^020 4025 4030 4035 
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rt» ^J? S.^^ ^'^T 'f^^ CCC ACA GGC CTG GCC 12619 

Arg Giu Thr Leu Val Gin Asp Asn lie Gin Trp Pro Thr Gly Leu Ala 
4040 404S 4050 

tly ^AC TGG GCA GAC GCC AAG CTT TCA 12667 

Val Asp. Tyr His Asn Glu Arg Leu Tyr Trp Ala Aso Ala Lys Leu Set 
4055 4060 . • * 4065 

GTC ATC GGC AGC ATC CGG CTC AAT GGC ACG GAC CCC ATT GTG GOT GOT 12715 
Val lie Gly Ser lie Arg Leu Asn Gly Thr Asp Pro lie Val Ala Ala 
4070 4075 4080 

GAC AGC AAA CGA GGC CTA AGT CAC CCC TTC AGC ATC GAC GTC TTT GAG* ^- •*12763 

Pl^e Ser lie Asp Val Phe Glu 
4085 4090 4095 

GAT TAC ATC TAT GGT GTC ACC TAC ATC AAT AAT CGT GTC TTC AAG ATC 12611 
Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg Val Phe Lys He 
4100 4105 4110 4115 

CAT AAG TTT GGC CAC AGC CCC TTG GTC AAC CTG ACA GGG GGC CTG AGC 12859 
His Lys Phe Gly His Ser Pro Leu Val Asn Leu Thr Gly Gly Leu Ser 
4120 4125 4130 

CAC GCC TCT GAC GTG GTC CTT TAC CAT CAG CAC AAG CAG CCC GAA GTG 12907 
His Ala Ser Asp Val Val Leu Tyr His Gin His Lys Gin Pro Glu Val 
4135 4140 4145 

ACC AAC CCA TGT GAC CGC AAG AAA TGC GAG TGG CXC TGC CTG CTG AGC 12955 
Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser 
4150 4155 4160 

CCC AGT GGG CCT GTC TGC ACC TGT CCC AAT GGG AAG CGG CTG GAC AAC 13003 
Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lvs Arg Leu Asp Asn 
4165 4170 4175 

GGC ACA TGC GTG CCT GTG CCC TCT CCA ACG CCC CCC CCA GAT GCT CCC 13051 
Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro Pro Asp Ala Pro 
4180 4185 4190 4195 

CGG CCT GGA ACC TGT AAC CTG CAG TGC TTC AAC GGT GGC AGC TGT TTC 13099 
Arg Pro Gly Thr Cys Asn Leu Gin Cys Phe Asn Gly Gly Ser Cys Phe ' 
4200 4205 4210 

CTC AAT GCA CGG AGG CAG CCC AAG TGC CGC TGC CAA CCC CGC TAC ACG 13147 
Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin Pro Arg Tyr Thr 
4215 4220 4225 

GGT GAC AAG TGT GAA CTG GAC CAG TGC TGG GAG CAC TGT CGC AAT GGG 13195 
Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu His Cys Arg Asn Gly 
4230 4235 4240 

GGC ACC TGT GCT GCC TCC CCC TCT GGC ATG CCC ACG TGC CGG TGC CCC 13243 
Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr Cys Arg Cys Pro 
4245 4250 4255 

ACG GGC TTg ACG GGC CCC AAA TGC ACC CAG CAG GTG TGT GCG GGC TAC 13291 
Thr Gly Phe Thr Gly Pro Lys Cys Thr Gin Gin Val Cys Ala Gly Tyr 
4260 4265 4270 4275 
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Itl aS J^S ^ J^'^ ACT GK: AAC CAG 6GC AAC CAG CCC CAG 

Cys Ala Asn Asn Ser Thr Cys Thr Val Asn Gin Gly Asn Glti Pro Gin 
4280 428S 4290 

S'^ f o?^ ^ GAC CGC T6C CAG TAC CGG CAG 

Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin 
429S 4300 4305 

TGC TCT GGC TAC TGT GAG AAC TTT GGC ACA TGC CAG ATG GCT GCT GAT 
Cys Ser Gly Tyr Cys Glu Asn Phe Gly Thr Cys Gin Met Ala Ala Asp 
«310 4315 4320 

660 TCC CGA CAA TGC CGC TGC ACT GCC TAC TTT GAG 6GA TCG AGG TGT 

^Ati ^^"^ ^""^ "^^'^ ^^"^ fy'f P*»e GIu Cly Ser Arg Cys 

4330 4335 

GAG GTG AAC AAG TGC AGC CGC TGT CTC GAA G6G GCC TGT GTG GTC AAC 
Glu Val Asn Lys Cys Ser Arg Cys Leu Glu Gly Ala Cys Val Val Asn 

4345 4350 4355 

AAG CAG AGT GGG GAT GTC ACC TGC AAC TGC ACS GAT GGC CGG GTG GCC 
Lys Gin Ser Gly Asp Val Thr Cys Asn Cys Thr Asp Gly Arg Val Ala 
4360 43^5 4370 

CCC AGC TGT CT6 ACC TGC GTC GGC CAC TGC AGC AAT GGC GGC TCC TGT 
Pro Ser Cys Leu Thr Cys Val Gly His Cys Ser Asn Gly Gly Ser Cys 
4375 4380 4335 

ACC ATG AAC AGC AAA ATG ATG CCT GAG TGC CAG TGC CCA CCC CAC ATG 
Thr wet Asn Ser Lys Met Met Pro Glu Cys Gin Cys Pro Pro His Met 
4390 4395 44QQ 

ACA GGG CCC CGG TGT GAG GAG CAC GTC TTC AGC CAG CAG CAG CCA GGA 

''^^ Val Phe Ser Gin Gin Gin Pro Gly 

4405 4410 . 4415 

CAT ATA GCC TCC ATC CTA ATC CCT CTG CTG TTG CTG CTG CTG CTG GTT 
.!rif "® ^'^^ I-e" l-eu Leu Leu Leu Leu Leu Val 

4420 4425 4430 4435 

CTG GTG GCC GGA GTG GTA TTC TGG TAT AAG CGG CGA GTC CAA GGG GCT 
Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg Val Gin Gly Ala 
4440 4445 4450 

AAG GGC TTC CAG CAC CAA CGG ATG ACC AAC GGG GCC ATG AAC GTG GAG 
Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala Met Asn Val Glu 
4455 4460 4466 

ATT GGA AAC CCC ACC TAC AAG ATG TAC GAA GGC GGA GAG CCT GAT GAT 
zie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp 
4470 4475 '4480 

GTG GGA GGC CTA CTG 6AC GCT GAC TTT GCC CTG GAC CCT GAC AAG CCC 
val Gly Gly Leu Leu Asp Ala Asp Phe Aid Leu Asp Pro Asp Lys Pro 
4485 4490 ' 



4495 



ACC AAC TTC ACC AAC CCC GTG TAT GCC ACA CTC TAC ATG GGG GGC CAT 

Ala Thr Leu Tyr Met Gly Gly His 
4500 4505 4510 4515 



13339 



13387 



13435 



13483 



13531 



13579 



13627 



13675 



13723 



13771 



13819 



13867 



13915 



13963 



14011 



FIG. 14A 



wo 01/92474 PCT/USOl/18041 

81/91 



III stl S« f ^ r^"^ n?"" ''^^ ^'^^ ^^'^ CTC CTG 

Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys Arg Glu Leu Leu 

4520 4525 . 4530 

GGC CGG GGC CCT GAG GAC GAG ATA 6GG GAC CCC TTG GCA TAGGGCCCTG CC 
CCGTCGGACT GCCCCCAGAA AGCCTCCTGC CCCCTGCCGG iG^GTCCTf ^S^^ 
Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu Ala 
4535* 4540 
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CTCCCCAGCC 
CATTTTATAT 
CCTGCCTGCT 
GCTGCACCTC 
TCCCCTCCCT 
CCGCTCCCAC 
CTGGAAGACG 
6GAGGCCACC 
CCCCCCTCCC 
AGACTTCCTC 
GCCACACCCA 
TTTTGCTGAA 
AAAAAA 



AGCCCTTCCC 
GTGAGCGAGC 
CXrTTGGCACC 
CTACCCTCCC 
GTATAAGACA 
AGCTTCCTGA 
TGGCTCTGGG 
CCAAACCCCA 
AGACAGGOCC 
TGGTAAACAT 
GGAAGGGAAA 
TTCTTTACAA 



TG6CCCCGCC 
AAGCCGGCAA 
CCCATGCTGC 
ACCAGAACGC 
CTTTGCCAAG 
GGGCTAATTC 
TGAGGTAGGC 
GCCCCAACTC 
TCCCTGTCTC 
TCCTCCAGCC 
GCGGGCAGCC 
CTAAATAACA 



GGATGTATAA 
GCGAGCACAG 
CTTCAGGGAG 
ACCCCACTGG 
GCTCTCCCCt 
TGGGAAGGGA 
GGGAAAGGAT 
CAGGGGCACC 
CAGGGCCCCC 
TCXCCTCCCC 
CCGTTTTGGG 
CACATATTCT 



ATGTAAAAAT 
TATTATTTCT 
ACAGGCAGGG 
GAGAGCTGGT 
CTCGCCCCAT 
GAGTTCTTTG 
GGAGTGTTTT 
TATGAGATGG 
ACCGAGGTTC 
TGG6GACGCC 
GACGTGAACG 
TATAAATAAA 



GAAGGAATTA 
CCATCCCCTC 
AG6GCXTGGG 
GGTGCAGCCT 
CCCTGCTTGC 
CTGCCCCTGT 
AGTTCTTGGG 
CCATGCTCAA 
CCAGGGCTGG 
AAGGAGGTGG 
TTTTAATAAT 
ATT6TAAAAA 
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mL Aeu Thr p^o Pz: o Leu Leu T..„ r .,. r^u Pr. r,.n r^.u j^. . . 

^ 10 IS 
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His Gin Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu Asn 

Asp Gin Tyr Gly Lys Pro Gly Gly Cys Ser Jsp lie Cys Leu Leu HI 

490 495 
Asn ser His Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser I*u 

505 

Gly Ser Asp Gly Lys Ser Cys Lys Lys Pro Giii His Glu Leu Phe Leu 

. 520 525 
Val Tyr Gly Lys Gly Arg Pro Gly He He Aig Gly Met Asp Met Gly 

530 535 g^Q 

Ala Lys Val Pro Asp Glu His Met He Pro lie Glu Asn Leu Met Asn 
« , 555 560 

Pro Arg Ala Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe Ala 

565 570 575 

ASP Thr Thr Ser Tyr Leu He Gly Arg Gin Lys He Asp Gly Thr Glu 

580 585 
Arg Glu Thr He Leu Lys Asp Gly He His Asn Val Glu Gly Val Ala 

595 600 605 

Val Asp Trp Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro Lys 

61S 620 
Lys Thr He Ser Val Ala Arg Leu Glu Lys Ala i^a Gin Thr Arg Lys 
• 630 635 640 

Thr Leu He Glu Gly Lys Met Thr His Pro Arg Ala He Val Val Asp 

645 650 655 

Pro Leu Asn Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro Lys 

660 665 670 

Asp Ser Arg Arg Gly Arg Leu Glu Arg Ala Trp Met Asp Gly Ser His 

675 680 685 

Arg Asp He Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly Leu 

690 695 700- 

Ser Leu Asp He Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe Tyr 
705 710 715 720 

Asp Arg He Glu Thr He Leu Leu Asn Gly Thr Asp Arg Lys He Val 

'25 730 735 

Tyr Glu Gly Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His Gly 

740 745 750 

Asn Tyr Leu Phe Trp Thr Glu Tyr Arg Ser Gly Ser Val Tyr Arg Leu 

755 760 765 

Glu Arg Gly Val Gly Gly Ala Pro Pro Thr Val Thr Leu Leu Arg Ser 

770 775 780 

Glu Arg Pro Pro He Phe Glu He Arg Met Tyr Asp Ala Gin Gin Gin 
7f5 790 795 800 

Gin Val Gly Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser Ser 

805 810 815 

Leu Cys Leu Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu Asp 

820 825 830 

Gin Val Leu Asp Ala Asp Gly Val Thr Cys Leu Ala Asn Pro Ser Tyr 

835 840 845 
Val Pro Pro Pro ^r^— — ~— 
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Ih£ £2ds Ser fOa ^ Thr] Cys Pro P/o '^Asn Gin Phe Ser Cys'^Ha 
Ser Gly Arg . Cys Tie :Pro IIe^Ser:.>rrp^^^^^^^ AipW 
Cys Gly Asd /Ara.Sert^o /r?r,:l5$^^;iAr^ n--. -.wl^^S-T^i; 




Asp Pro Ser^Val Lys' Phe Gly Cysj-Lysl^Asp 'Ser-^Ala'^.^^ Xie'^er 

105 1110 ^^i^ .^' \'^'^iiis^^v ••iWo 

Lys Ala Trp Val Cys Asp Gly Asp- Asn Asp Cys Glu Asp Asn Ser-;ASp 
1125 > 1130., • 1135'.^1J 

Glu Glu Asn Cys Glu Ser Leu Ala..C^ Arg Pro Pro Ser His .Prd^GvS 

' 1.140 1.145. ' 1150 

Ala Asn Ma Ihr Ser Val Cys Leu ££0 ■ Pro Aso i^s Leu £^ Asofplv . 

1-155 : 1160 • • - 1165 

Mn Aso M£ £y£ £ly hSS, SXx §^ hSB, Sh, gly glu gys As^' Gin 

1170 1175 IrtO 

Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala Pro 
1S5 1190 1195 1200 

Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly Pro 

1205 1210 1215 

Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu Lys 

1220 1225 1230 

Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser Cys 

1235 1240 1245 

Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Ser Cys Arg Ser Leu 

1250 1255 1260 

Asp Pro Phe Lys Pro Phe He He Phe Ser Asn Arg His Glu He Arg 
265 1270 1275 1280 

Arg He Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly Leu 

1285 1290 1295 

Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu Tyr 

1300 1305 1310 

Trp Thr Asp Val Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu Asp 

1315 1320 1325 

Asn Gly Ala Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu Ala 

1330 1335 1340 

Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr Trp 
345 1350 1355 1360 

Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly Thr 

1365 1370 1375 

Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala He 
1380 1385 1390 
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Ala Leu Asp Pro Arg Asp Gly He Leu Phe Trp Thr Asp Trp Asp Ala 

- r 1400 1405 

Ser Leu Pro Arg He Glu Ala Ala Ser Met Ser Gly Ala Gly Arg Arg 

1415 1420 
Thr Val His Arg Glu Thr Gly Ser Gly Gly Trp Pro Asn Gly Leu Thr 
, * 1435 1440 

Val Asp Tyr Leu Glu Lys Arg He Leu Trp He Asp Ala Arg Ser Asp 

„ 1445 1450 1455 

Ala He Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val Leu 

1460 1465 1470 

Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr Gly 
1475 • 1480 1485 

*^^^1490 '**'l49S '"^^ "^^"^ ''^^ 

Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn Thr 

^? » «u 1515 1520 

Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met Ala 
„ « 1525 1530 1535 

Pro Asn Pro Cys Glu Ala Asn Gly Gly Gin Gly Pro Cys Ser His Leu 
« , 1540 1545 1550 

Cys Leu He Asn Tyr Asn Arg Thr Val Ser Cys Ala Cys Pro His Leu 
1555 1560 1565 

''^^^ Tyr Glu Phe Lys Lys Phe 
1570 1575 1580 

Leu Leu Tyr Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp Ala 
585 1590 1595 igoo 

Pro Tyr Tyr Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp Asn 
, , 1^05 1610 1615 

Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp Ser 

1620 1625 1630 

Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr Gly 

1635 1640 1645 

Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu Ala 

1650 1655 1660 

Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr Asn 
, 1670 1675 1680 

Lys Lys Gin He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn Ala 

1685 16.90 1695 ' 

Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro Leu 

1700 1705 1710 

Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala Asn 

1715 1720 1725 

Met Asp Gly Ser Asn Arg Thr Leu Leu Phe Ser Gly Gin Lys Gly Pro 

1730 1735 1740 

Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He Ser 
1750 . 1755 . 1760 

Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Gly Leu 

„ , "65 1770 1775 

GXu Val He Asp Ala Met Arg Ser Gin Leu Gly Lys Ala Thr Ala Leu 

1780 1785 1790 

Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu Lys 

1795 1800 .. 1805 

Met Gly Thr Cys Ser Lys Ala Asp Gly Ser Gly Ser Val Val Leu Arg 

1810 1815 . 1820 

Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser lie 
825 1830 1835 1840 

Gin Leu Asp His Lys Gly Thr Asn Pro Cys Ser Val Asn Asn Gly Asp • 

1845 • 1850 1855 

Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys Met 
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"60 1865 1870 

Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu Gly 
18/5 1880 10Q5 

^^^noiX ^^"^ "^y"" «i« Glu Gly He Arg Gly lie 

1890 1895 1900 

Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser Gly 

£2 « ^^^^ 1915 1920 

Thr Set Leu Ala Val Gly He Asp. Phe His Ala Glu Asn Asp Thr He 

1525 1930 *^iQ35 

Tyr Trp Val Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg Asp 

1940 1945 1950 

Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val Glu 
1955 1960 1965 

Int '^^^ '^^^ Trp Thr Asp Gin 

1970 1975 1980 

Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg Tyr 
985 1990 1995 2000 

Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val His 

2005 2010 2015 

Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly Gin Tyr Pro Arg 

2020 2025 2030 

He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val Asn 

2035 2040 2045 

Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Asp Gly 

2050 2055 2060 

Lys Leu Tyr Trp Cys Asp Ala Arg Thr Asp Lys He Glu Arg He Asp 
065 2070 2075 2080 

Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn Met 

2085 2090 2095 

Asii Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser Asp 

2100 2105 2110 

Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Ser Lys Asp Asn Ala 

2115 2120 2125 

Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys Asp 

2130 2135 2140 

He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys Ala 
145 2150 2155 2160 

Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Arg Gly 

2165 2170 2175 

Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly Ala 

2180 2185 2190 

Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr He 

2195 2200 2205 

Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro Val 

2210 2215 2220 

Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu Ala 
225 2230 2235 2240 

Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He Phe 

^ 2245 2250 2255 

Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp Gly 

2260 2265 2270 

Ser Arg Arg He Thr He Val gIu Asn Val Gly Ser Val Glu Gly Leu 

2275 2280 2285 

Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr Thr 

2290 2295 2300 

Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala Phe 

2310 2315 2320 

Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg Ala 
2325 2330 2335 
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Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp Asn 

2340 2345 2350 

Glu Gin His Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn Val 

2355 2360 2365 

Leu Thr Leu He Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala He 

2370 2375 2380 

Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp Lys 
385 2390 2395 2400 

He Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu Lys 

2405 2410 2415 

Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His He 

2420. 2425 2430 

Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys His 

2435 2440 2445 

^^^-.^i^ Ser Asn Met Lys Leu Leu Arg Val Asp He Pro Gin Gin Pro 

2450 2455 2460 

Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu Ser 
"5 2470 2475 2480 

Pro Cys Arg He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu Thr 

2485 2490 2495 

His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu Gin 

2500 2505 2510 

Asp Asp Leu Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin Asp 

2515 2520 2525 

Glu Phe Glu Cys Ala Asn Gly Glu Cys He Asn Phe Ser Leu Thr Cys 

2530 2535 2540 

Asp Gly Val Pro His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser Tyr 
545 2550 2555 2560 

Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Ser Asn Gly 

2565 2570 2575 

Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Ala Asp Asp Cys Gly 

2580 2585 2590 

Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val Gly 

^ 2595 .2600 2605 

Glu Phe Arg Cys Arg Asp Gly Thr Cys He Gly Asn Ser Ser Arg Cys 

2610 2615 2620 

Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys Ser 
^25 2630 2635 2640 

Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val Leu 

^ 2645 2650 2655 

Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp Val 

2660 2665 2670 

Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp Cys 

2675 2680 2685 ^ ^ 

Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys Pro 

2690 2695 2700 

Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp Asp 
705 2710 2715 2720 

Cys Glu His Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser Glu 

2725 2730 2735 

Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp Leu 

2740 2745 2750 

Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala Ris 

2755 2760 . 2765 

Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly Thr 

2770 2775 2780 

His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp Cys 
785 . 2790 2795 2800 

Ala Asp Gly Ala Asp Glu Ser He Ala Ala Gly Cys Leu Tyr Asn Ser 
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^. ^ ^ 2805 2810 2815 

Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Gin Cys He Pro 

2820 2825 ♦ 2830 

Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser Asp 
2835 2840 2845 

^ o!!n ^^"^ "^y^ Cys Gly Pro Ser Glu Phe Arg 

2850 2855 .2860 

Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Glh Trp Glu Cys Asp Gly 
5" 2870 2875 2880 

Glu Asn Asp Cys His Asp Gin Ser Asp Glu Ala Pro Lys Asn Pro His 

2885 2890 2895 

Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu Cys 

2900 2905 2910 

Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin Asp 
2915 2920 2925 

^9 Gly Cys His He Asn Glu Cys 
2930 2935 2940 

Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu Lys 

.1 ou , ^^^^ "55 2960 

He Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp Asp 

2965 2970 2975 

Gly Arg Thr Cys Ala Asp Val Asp Glu Cys Ser Thr Thr Phe Pro Cys 

2980 2985 2990 

Ser Gin Arg Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys Val 

2995 3000 3005 ' 

Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala Val 

3010 3015 3020 

Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu Arg 
025 3030 3035 3040 

Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly Leu 

3045 3050 3055 

Asn Asn Ala Val Ala Leu Asp Phe Asp Tyr Arg Glu Gin Met He Tyr 

3060 3065 3070 

Trp Thr Asp Val Thr Thr Gin Gly Ser Met He Arg Arg Met His Leu 

3075 3080 3085 

Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn Pro 

3090 3095 3100 

Asp Gly Leu Ala Val Asp Trp Val Gly . Gly Asn Leu Tyr Trp Cys Asp 
105 3110 3115 3120 

Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr Arg 

3125 3130 3135 

Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val Val 

3140 3145 3150 

Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His Ser 

3155 3160 3165 

Leu He Gly Arg He Gly Met Asp Gly .Ser Ser Arg Ser Val He Val 

3170 3175 3180 

Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Leu Asp Tyr Val Thr 
IfiS 3190 3195 3200 

Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe Ala 

3205 3210 3215 

Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He Pro 

3220 3225 3230 

His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr Asp 

3235 3240 3245 

Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Thr Asn 

3250 3255 3260 

Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His Val 
265 3270 3275 3280 
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Phe Hxs Hxa i.«u «iy v»xn Pro Asp Val Pro Asn His Pro Cys Lys Val 

* * ^2®^ 3290 3295 

Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly Gly 

3300 3305 3310 

His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Ser Asp Gly Arg 

3315 3320 3325 

^^^-ao^n '^^^ Cys Lys Asn Asp 

3330 3335 3340 

Lys Cys lie Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys Gly 

3350 3355 *' ^ 33gJ. 

Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg Pro 

3365 3370 3375 

Gly Gin Phe Gin Cys Ser thr Gly He Cys Thr Asn Pro Ala Phe He 

3380 3385 3390 

Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn Cys. 

3395 3400 3405 

Asp He His Val Cys . Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr Asn 

3410 3415 3420 

Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys Gly 
«5 3430 3435 3440 

Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro Asn 

«w ^, ^^^^ 3455 

Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp Val 

3460 3465 3470 

Cys Asp Arg Asp Asn Asp Cys Val Asp Gly Ser Asp Glu Pro Ala Asn 

3475 3480 3485 

Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp Ser 

3490 3495 3500 

Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp Cys 
505 3510 3515 3520 

Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr Cys 

3525 3530 3535 

Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly Arg 

3540 3545 3550 

Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Glu 

3555 3560 3555 

Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Ser Cys Ala Asn 

3570 3575 3580 

Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp Cys 
585 3590 3595 3600 

Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met Asp 

3605 3610 3615 

Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Arg Cys 

3620 3625 3630 

Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys Gly 

3635 3640 3645 

Thr Gly Val Arg Thr Cys Pi^o Leu Asp Glu Phe Gin Cys Asn Asn .Thr 

3650 3655 . 3660 

Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly 
665 3670 3675 3680 

Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe Val Cys Pro 

3685 3690 3695 

Pro Asn Arg Pco Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp He 

3700 3705 3710 

Gly Arg Gin Cys Asp Gly Thr Asp Asn Cys Gly Asp Gly Thr Asp Glu 

3715 . 3720 3725 

Glu Asp Cys Glu Pro Pro Thr Ala His Thr Thr His Cys Lys Asp Lys 

3730 3735 3740 

Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu Arg 
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Ser lie Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser"Ie Cys 
J/ou 3785 



^''SO ^n^A 
cys Asa Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp J Js 
3765 3770 3775 

c 

Gly Asp Glu^AU Arg Cys Val Arg^xSr Glu Lys Ala Ala'^JyJ Cys Ala 
•i. - 3800 3805 

cys Arg Ser Gly Phe His Thr Vai Pro Gly Gin Pro Gly Cys Gin -Asp 

Joi5 3820 
lie Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Cys Asn Asn 

j830 3835 004 A 

Thr Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys t1? 



His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr He 

^ 3865 3870 

^" P'o Gly His Pro His Ser 

•3o/i> 3880 3885 

ser yal Arg He Asp Ala 
3895 3900 
Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp His 

3910 3915 3920 

Thr Gly Thr He^Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro Thr 

o925 3930 3935 

Thr Ser Asn Arg Hi? Arg Arg Gin He Asp Arg Gly Val Thr His Leu 

3940 3945 3950 

Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp Val 
3955 3960 3965 

r,Vil '^y" Trp Thr Asp Ser Gly Arg Asp Val He Glu Val 

3975 3980 
Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met He 
• 3990 3995 4000 

Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met Tyr 
, « . 4010 4015 

Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met Asp 
„ 4020 4025 4030 

Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro Thr 

4035 4040 4045 

^^^n^n ''^^ Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp Ala 
, 4050 4055 4oeO 

Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro He 
« r 4075 4080 

Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He Asp 

*085 4090 4095 

Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg Val 

4100 4105 4110 

Phe Lys He His . Lys Phe Gly His Ser Pro Leu Val Asn Leu Thr Gly 
4115 4120 4125 ' 

^A^ln ^^"^ Tyr His Gin His Lys Gin 

4135 4140 
Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu Cys 
r . « ^^5° 4155 4160 

Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys Arg 

4165 4170 4175 

Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro Pro 
. <180 4185 4190 

Asp Ala Pro Arg Pro Gly Thr Cys Asn Leu Gin Cys Phe Asn Gly Gly 
4195 . 4200 4205 

^^'^ '^^^ Arg Gin Pro Lys Cys Arg Cys Gin Pro 

4210 4215 4220 
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Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu His Cys 

4235 AOdd 
Arg Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Het Pro Thr Cys 

^245 4250 4255 

Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Gin Gin Val Cys 
*i ^, ^2.^5 4270 

Ala Gly Tyr Cys Ala Asn Asn Ser Thr Cys Thr Val Asn Gin Gly A^n 
4275 4280 4235 

^ 4290 ^^^4295 

Tyr Arg Gin Cys Ser Gly Tyr Cys Glu Asn Phe Gly Thr Cys Gin Met 

•^"5 4310 f3i5 ^ . 432Q 

Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Ala Tyr Phe Glu Gly 

4325 4330 4335 

Ser Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Glu Gly Ala Cys 

. , 4345 4350 

Val Val Asn Lys Gin Ser Gly Asp Val Thr Cys Asn Cys Thr Asp Gly 

4355 4360 4365 

'^^.Y^i '^^'^ Val Gly His Cys Ser Asn Gly 

4370 4375 4390 

Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys Pro 
l^^ „. '*390 4395 4400 

Pro His Met Thr Gly Pro Arg Cys Glu Glu His Val Phe Ser Gin Gin 

4405 4410 4415 

Gin Pro Gly His lie Ala Ser lie Leu lie Pro Leu Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Val Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg Val 
4435 - 4440 4445 

"i* Gin Arg Met Thr Asn Gly Ala Met 
4450 4455 4460 

Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly Glu 
465 4470 4475 4480 

Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp Pro 

4485 4490 4495 

Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr Met 

4500 4505 4510 

Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys Arg 
4515 4520 4525 

^"^9 Gly Pro Glu Asp Glu He Gly Asp Pro Leu Ala 
4530 4535 454O 
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SEQUENCE LISTING 

<110> Aatigenics, Inc. 

<120> ALPHA (2) MACROGLOBULIN RECEPTOR AS A HEAT SHOCK 
PROTEIN RECEPTOR AND USES THEREOF 

<130> 8449-134-228 

<140> 
<141> 

<150> 09/750,972 
<151> 2000-12-28 

<150> 09/668,724 
<151> 2000-09-22 

<150> 60/209,095 
<151> 2000-06/02 

<160> 57 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 14849 
<212> DNA 

<213> Mus musculus 

<400> 1 

cgctgctccc cgccagtgca ctgaggaggc ggaaacgggg gagcccctag tgctccatca 60 

ggcccctacc aaggcacccc catcgggtcc acgcccccca ccccccaccc cgcctcctcc 120 

caattgtgca tttttgcagc cggagtcggc tccgagatgg ggctgtgagc ttcgccctgg 180 

gagggggaga ggagcgagga gtaaagcagg ggtgaagggt tcgaatttgg gggcaggggg 240 

cgcacccgcg tcagcaggcc cttcccaggg ggctcggaac tgtaccattt cacctatgcc 300 

cctggttcgc tttgcttaag gaaggataag atagaagagt cggggagagg aagataaagg 360 

gggacccccc aattgggggg ggcgaggaca agaagtaaca ggaccagagg gtgggggctg 420 

ctgtttgcat cggcccacac catgctgacc ccgccgttgc tgctgctcgt gccgctgctt 480 

tcagctctgg tctccggggc cactatggat gcccctaaaa cttgcagccc taagcagttt 540 

gcctgcagag accaaatcac ctgtatctca aagggctggc ggtgtgacgg tgaaagagat 600 

tgccccgacg gctctgatga agcccctgag atctgtccac agagtaaagc ccagagatgc 660 

ccgccaaatg agcacagttg tctggggact gagctatgtg tccccatgtc tcgtctctgc 720 

aacgggatcc aggactgcat ggatggctca gacgagggtg ctcactgccg agagctccga 780 

gccaactgtt ctcgaatggg ttgtcaacac cattgtgtac ctacacccag tgggcccacg 840 

tgctactgta acagcagctt ccagctcgag gcagatggca agacgtgcaa agattttgac 900 

gagtgttccg tgtatggcac ctgcagccag ctttgcacca acacagatgg ctccttcaca 960 

tgtggctgtg ttgaaggcta cctgctgcaa ccggacaacc gctcctgcaa ggccaagaat 1020 

gagccagtag atcggccgcc agtgctactg attgccaact ctcagaacat cctagctacg 1080 

tacctgagtg gggcccaagt gtctaccatc acacccacca gcacccgaca aaccacggcc 1140 

atggacttca gttatgccaa tgagaccgta tgctgggtgc acgttgggga cagtgctgcc 1200 

cagacacagc tcaagtgtgc ccggatgcct ggcctgaagg gctttgtgga tgagcatacc 1260 

atcaacatct ccctcagcct gcaccacgtg gagcagatgg caatcgactg gctgacggga 1320 

aacttctact ttgtcgacga cattgacgac aggatctttg tctgtaaccg aaacggggac 1380 

acctgtgtca ctctgctgga cctggaactc tacaacccca aaggcatcgc cttggacccc 1440 

gccatgggga aggtgttctt cactgactac gggcagatcc caaaggtgga gcgctgtgac 1500 

atggatggac agaaccgcac caagctggtg gatagcaaga tcgtgtttcc acacggcatc 1560 
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accctggacc tggtcagccg cctcgtctac tgggcggacg cctacctaga ctacatcgag 1620 

gtggtagact acgaagggaa gggtcggcag accatcatcc aaggcatcct gatcgagcac 1680 

ctgtacggcc tgaccgtgtt tgagaactat ctctacgcca ccaactcgga caatgccaac 1740 

acgcagcaga agacgagcgt gatccgagtg aaccggttca acagtactga gtaccaggtc 1800 

gtcacccgtg tggacaaggg tggtgccctg catatctacc accagcgacg ccagccccga 1860 

gtgcggagtc acgcctgtga gaatgaccag tacgggaagc caggtggctg ctccgacatc 1920 

tgcctcctgg ccaacagtca caaggcaagg acctgcaggt gcaggtctgg cttcagcctg 1980 

ggaagtgatg ggaagtcttg taagaaacct gaacatgagc tgttcctcgt gtatggcaag 2040 

ggccgaccag gcatcattag aggcatggac atgggggcca aggtcccaga tgagcacatg 2100 

atccccatcg agaaccttat gaatccacgc gctctggact tccacgccga gaccggcttc 2160 

atctactttg ctgacaccac cagctacctc attggccgcc agaaaattga tggcacggag 2220 

agagagacta tcctgaagga tggcatccac aatgtggagg gcgtagccgt ggactggatg 2280 
ggagacaatc tttactggac tgatgatggc cccaagaaga ccattagtgt ggccaggctg . 2340 

gagaaagccg ctcagacccg gaagactcta attgagggca agatgacaca ccccagggcc 2400 

attgtagtgg atccactcaa tgggtggatg tactggacag actgggagga ggaccccaag 2460 

gacagtcggc gagggcggct cgagagggct tggatggacg gctcacaccg agatatcttt 2520 

gtcacctcca agacagtgct ttggcccaat gggctaagcc tggatatccc agccggacgc 2580 

ctctactggg tggatgcctt ctatgaccga attgagacca tactgctcaa tggcacagac 2640 

cggaagattg tatatgaggg tcctgaactg aatcatgcct tcggcctgtg tcaccatggc 2700 

aactacctct tttggaccga gtaccggagc ggcagcgtct accgcttgga acggggcgtg 2760 

gcaggcgcac cgcccactgt gacccttctg cgcagcgaga gaccgcctat ctttgagatc 2820 

cgaatgtacg acgcgcacga gcagcaagtg ggtaccaaca aatgccgggt aaataacgga 2880 

ggctgcagca gcctgtgcct cgccaccccc gggagccgcc agtgtgcctg tgccgaggac 2 940 

caggtgttgg acacagatgg tgtcacctgc ttggcgaacc catcctacgt gcccccaccc 3000 

cagtgccagc cgggccagtt tgcctgtgcc aacaaccgct gcatccagga gcgctggaag 3060 

tgtgacggag acaacgactg tctggacaac agcgatgagg ccccagcact gtgccatcaa 3120 

cacacctgtc cctcggaccg attcaagtgt gagaacaacc ggtgtatccc caaccgctgg 3180 

ctctgtgatg gggataatga ttgtggcaac agcgaggacg aatccaatgc cacgtgctca 3240 

gcccgcacct gtccacccaa ccagttctcc tgtgccagtg gccgatgcat tcctatctca 3300 

tggacctgtg atctggatga tgactgtggg gaccggtccg atgagtcagc ctcatgcgcc 33 60 

taccccacct gcttccccct gactcaattt acctgcaaca atggcagatg tattaacatc 3420 

aactggcggt gtgacaacga caatgactgt ggggacaaca gcgacgaagc cggctgcagt 3480 

cactcctgct ccagtaccca gttcaagtgc aacagtggca gatgcatccc cgagcactgg 3540 

acgtgtgatg gggacaatga ttgtggggac tacagcgacg agacacacgc caactgtacc 3600 

aaccaggcta caagacctcc tggtggctgc cactcggatg agttccagtg cccgctagat 3660 

ggcctgtgca tccccctgag gtggcgctgc gacggggaca ccgactgcat ggattccagc 3720 

gatgagaaga gctgtgaggg cgtgacccat gtttgtgacc cgaatgtcaa gtttggctgc 3780 

aaggactccg cccggtgcat cagcaaggcg tgggtgtgtg atggcgacag cgactgtgaa 3840 

gataactccg acgaggagaa ctgtgaggcc ctggcctgca ggccaccctc ccatccctgc 3900 

gccaacaaca cctctgtctg cctgcctcct gacaagctgt gcgacggcaa ggatgactgt 3960 

ggagacggct cggatgaggg cgagctctgt gaccagtgtt ctctgaataa tggtggctgt 4020 

agtcacaact gctcagtggc ccctggtgaa ggcatcgtgt gctcttgccc tctgggcatg 4080 

gagctgggct ctgacaacca cacctgccag atccagagct actgtgccaa gcacctcaaa 4140 

tgcagccaga agtgtgacca gaacaagttc agtgtgaagt gctcctgcta cgagggctgg 4200 

gtcttggagc ctgacgggga aacgtgccgc agtctggatc ccttcaaact gttcatcatc 4260 

ttctccaacc gccacgagat caggcgcatt gaccttcaca agggggacta cagcgtccta 4320 

gtgcctggcc tgcgcaacac tattgccctg gacttccacc tcagccagag tgccctctac 4380 

tggaccgacg cggtagagga caagatctac cgtgggaaac tcctggacaa cggagccctg 4440 

accagctttg aggtggtgat tcagtatggc ttggccacac cagagggcct ggctgtagat 4500 

tggattgcag gcaacatcta ctgggtggag agcaacctgg accagatcga agtggccaag 4560 

ctggacggaa ccctccgaac cactctgctg gcgggtgaca ttgagcaccc gagggccatc 4620 

gctctggacc ctcgggatgg gattctgttt tggacagact gggatgccag cctgccacga 4680 

atcgaggctg catccatgag tggagctggc cgccgaacca tccaccggga gacaggctct 4740 

gggggctgcg ccaatgggct caccgtggat tacctggaga agcgcatcct ctggattgat 4800 

gctaggtcag atgccatcta ttcagcccgg tatgacggct ccggccacat ggaggtgctt 4 860 

cggggacacg agttcctgtc acacccattt gccgtgacac tgtacggtgg ggaggtgtac 4920 

tggaccgact ggcgaacaaa tacactggct aaggccaaca agtggactgg ccacaacgtc 4980 

accgtggtac agaggaccaa cacccagccc ttcgacctgc aggtgtatca cccttcccgg 5040 
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cagcccatgg ctccaaaccc atgtgaggcc aatggcggcc ggggcccctg ttcccatctg 5100 

tgcctcatca actacaaccg gaccgtctcc tgggcctgtc cccacctcat gaagctgcac 5160 

aaggacaaca ccacctgcta tgagtttaag aagttcctgc tgtacgcacg tcagatggag 5220 

atccggggcg tggacctgga tgccccgtac tacaattata tcatctcctt cacggtgcct 5280 

gatatcgaca atgtcacggt gctggactat gatgcccgag agcagcgagt ttactggtct 5340 

gatgtgcgga ctcaagccat caaaagggca tttatcaacg gcactggcgt ggagaccgtt 5400 

gtctctgcag acttgcccaa cgcccacggg ctggctgtgg actgggtctc ccgaaatctg 5460 

ttttggacaa gttacgacac caacaagaag cagattaacg tggcccggct ggacggctcc 5520 

ttcaagaatg cggtggtgca gggcctggag cagccccacg gcctggtcgt ccacccgctt 5580 

cgtggcaagc tctactggac tgatggggac aacatcagca tggccaacat ggatgggagc 5640 

aaccacactc tgctcttcag tggccagaag ggccctgtgg ggttggccat tgacttccct 5700 

gagagcaaac tctactggat cagctctggg aaccacacaa tcaaccgttg caatctggat 5760 

gggagcgagc tggaggtcat cgacaccatg cggagccagc tgggcaaggc cactgccctg 5820 

gccatcatgg gggacaagct gtggtgggca gatcaggtgt cagagaagat gggcacgtgc 5880 

aacaaagccg atggctctgg gtccgtggtg ctgcggaaca gtaccacgtt ggttatgcac 5940 

atgaaggtgt atgacgagag catccagcta gagcatgagg gcaccaaccc ctgcagtgtc 6000 

aacaacggag actgttccca gctctgcctg ccaacatcag agacgactcg ctcctgtatg 6060 

tgtacagccg gttacagcct ccggagcgga cagcaggcct gtgagggtgt gggctctttt 6120 

ctcctgtact ctgtacatga gggaattcgg gggattccac tagatcccaa tgacaagtcg 6180 

gatgccctgg tcccagtgtc cggaacttca ctggctgtcg gaatcgactt ccatgccgaa 6240 

aatgacacta tttattgggt ggatatgggc ctaagcacca tcagcagggc caagcgtgac 6300 

cagacatggc gagaggatgt ggtgaccaac ggtattggcc gtgtggaggg catcgccgtg 6360 

gactggatcg caggcaacat atactggacg gaccagggct tcgatgtcat cgaggttgcc 6420 

cggctcaatg gctcttttcg ttatgtggtc atttcccagg gtctggacaa gcctcgggcc 6480 

atcactgtcc acccagagaa ggggtacttg ttctggaccg agtggggtca ttacccacgt 6540 

attgagcggt ctcgccttga tggcacagag agagtggtgt tggttaatgt cagcatcagc 6600 

tggcccaatg gcatctcagt agactatcag ggcggcaagc tctactggtg tgatgctcgg 6660 

atggacaaga tcgagcgcat cgacctggaa acgggcgaga accgggaggt ggtcctgtcc 6720 

agcaataaca tggatatgtt ctccgtgtcc gtgtttgagg acttcatcta ctggagtgac 6780 

agaactcacg ccaatggctc catcaagcgc ggctgcaaag acaatgctac agactccgtg 6840 

cctctgagga caggcattgg tgttcagctt aaagacatca aggtcttcaa cagggacagg 6900 

cagaagggta ccaatgtgtg cgcggtagcc aacggcgggt gccagcagct ctgcttgtat 6960 

cggggtggcg gacagcgagc ctgtgcctgt gcccacggga tgctggcaga agacggggcc 7020 

tcatgccgag agtacgctgg ctacctgctc tactcagagc ggaccatcct caagagcatc 7080 

cacctgtcgg atgagcgtaa cctcaacgca ccggtgcagc cctttgaaga ccccgagcac 7140 

atgaaaaatg tcatcgccct ggcctttgac taccgagcag gcacctcccc ggggacccct 7200 

aaccgcatct tcttcagtga catccacttt gggaacatcc agcagatcaa tgacgatggc 7260 

tcgggcagga ccaccatcgt ggaaaatgtg ggctctgtgg aaggcctggc ctatcaccgt 7320 

ggctgggaca cactgtactg gacaagctac accacatcca ccatcacccg ccacaccgtg 7360 

gaccagactc gcccaggggc cttcgagagg gagacagtca tcaccatgtc cggagacgac 7440 

cacccgagag cctttgtgct ggatgagtgc cagaacctga tgttctggac caattggaac 7500 

gagctccatc caagcatcat gcgggcagcc ctatccggag ccaacgtcct gaccctcatt 7560 

gagaaggaca tccgcacgcc caatgggttg gccatcgacc accgggcgga gaagctgtac 7620 

ttctcggatg ccaccttgga caagatcgag cgctgcgagt acgacggctc ccaccgctat 7680 

gtgatcctaa agtcggagcc cgtccacccc tttgggttgg cggtgtacgg agagcacatt 7740 

ttctggactg actgggtgcg gcgggctgtg cagcgagcca acaagtatgt gggcagcgac 7800 

atgaagctgc ttcgggtgga cattccccag caacccatgg gcatcatcgc cgtggccaat 7860 

gacaccaaca gctgtgaact ctccccctgc cgtatcaaca atggaggctg ccaggatctg 7920 

tgtctgctca cccaccaagg ccacgtcaac tgttcctgtc gagggggccg gatcctccag 7980 

gaggacttca cctgccgggc tgtgaactcc tcttgtcggg cacaagatga gtttgagtgt 8040 

gccaatgggg aatgtatcag cttcagcctc acctgtgatg gcgtctccca ctgcaaggac 8100 

aagtccgatg agaagccctc ctactgcaac tcacgccgct gcaagaagac tttccgccag 8160 

tgtaacaatg gccgctgtgt atccaacatg ctgtggtgca atggggtgga ttactgtggg 8220 

gatggctctg atgagatacc ttgcaacaag actgcctgtg gtgtgggtga gttccgctgc 8280 

cgggatgggt cctgcatcgg gaactccagt cgctgcaacc agtttgtgga ttgtgaggat 8340 

gcctcggatg agatgaattg cagtgccaca gactgcagca gctatttccg cctgggcgtg 8400 

aaaggtgtcc tcttccagcc gtgcgagcgg acatccctgt gctacgcacc tagctgggtg 8460 

tgtgatggcg ccaacgactg tggagactac agcgatgaac gtgactgtcc aggtgtgaag 8520 
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cgccctaggt gcccgctcaa ttactttgcc tgccccagcg ggcgctgtat ccccatgagc 8580 

tggacgtgtg acaaggagga tgactgtgag aacggcgagg atgagaccca ctgcaacaag 8640 

ttctgctcag aggcacagtt cgagtgccag aaccaccggt gtatctccaa gcagtggctg 8700 

tgtgacggta gcgatgattg cggggatggc tccgatgagg cagctcactg tgaaggcaag 8760 

acatgtggcc cctcctcctt ctcctgtccc ggcacccacg tgtgtgtccc tgagcgctgg 8820 

ctctgtgatg gcgacaagga ctgtaccgat ggcgcggatg agagtgtcac tgctggctgc 8880 

ctgtacaaca gcacctgtga tgaccgtgag ttcatgtgcc agaaccgctt gtgtattccc 8940 

aagcatttcg tgtgcgacca tgaccgtgac tgtgctgatg gctctgatga atcccctgag 9000 

tgtgagtacc caacctgcgg gcccaatgaa ttccgctgtg ccaatgggcg ttgtctgagc 9060 

tcccgtcagt gggaatgtga tggggagaat gactgtcacg accacagcga tgaggctccc 9120 

aagaacccac actgcaccag cccagagcac aaatgcaatg cctcatcaca gttcctgtgc 9180 

agcagcgggc gctgcgtggc tgaggcgttg ctctgcaacg gccaggacga ctgtggggac 9240 

ggttcagacg aacgcgggtg ccatgtcaac gagtgtctca gccgcaagct cagtggctgc 9300 

agtcaggact gcgaggacct caagataggc tttaagtgcc gctgtcgccc gggcttccgg 9360 

ctaaaggacg atggcaggac ctgtgccgac ctggatgagt gcagcaccac cttcccctgc 9420 

agccagctct gcatcaacac ccacggaagt tacaagtgtc tgtgtgtgga gggctatgca 9480 

ccccgtggcg gtgaccccca cagctgcaaa gctgtgaccg atgaggagcc atttctcatc 9540 

tttgccaacc ggtactacct gcggaagctc aacctggacg gctccaacta cacactgctt 9600 

aagcagggcc tgaacaatgc ggtcgccttg gcatttgact accgagagca gatgatctac 9660 

tggacgggcg tgaccaccca gggcagcatg attcgcagga tgcacctcaa cggcagcaac 9720 

gtgcaggttc tgcaccggac gggccttagt aacccagatg ggctcgctgt ggactgggtg 9780 

ggtggcaacc tgtactggtg tgacaagggc agagatacca ttgaggtgtc caagcttaac 9840 

ggggcctatc ggacagtgct ggtcagctct ggcctccggg agcccagagc tctggtagtg 9900 

gatgtacaga atgggtacct gtactggaca gactggggtg accactcact gatcggccgg 9960 

attggcatgg atggatctgg ccgcagcatc atcgtggaca ctaagatcac atggcccaat 10020 

ggcctgaccg tggactacgt cacggaacgc atctactggg ctgacgcccg tgaggactac 10080 

atcgagttcg ccagcctgga tggctccaac cgtcacgttg tgctgagcca agacatccca 10140 

cacatctttg cgctgaccct atttgaagac tacgtctact ggacagactg ggaaacgaag 10200 

tccatcaacc gggcccacaa gaccacgggt gccaacaaaa cactcctcat cagcaccctg 10260 

caccggccca tggacttaca tgtattccac gccctgcgcc agccagatgt gcccaatcac 10320 

ccctgcaaag tcaacaatgg tggctgcagc aacctgtgcc tgctgtcccc tgggggtggt 10380 

cacaagtgcg cctgccccac caacttctat ctgggtggcg atggccgtac ctgtgtgtcc 10440 

aactgcacag caagccagtt tgtgtgcaaa aatgacaagt gcatcccctt ctggtggaag 10500 

tgtgacacgg aggacgactg tggggatcac tcagacgagc ctccagactg tcccgagttc 10560 

aagtgccgcc caggccagtt ccagtgctcc accggcatct gcaccaaccc tgccttcatc 10620 

tgtgatgggg acaatgactg ccaagacaat agtgacgagg ccaattgcga cattcacgtc 10680 

tgcttgccca gccaattcaa gtgcaccaac accaaccgct gcattcctgg catcttccgt 10740 

tgcaatgggc aggacaactg cggggacggc gaggatgagc gggattgccc tgaggtgacc 10800 

tgcgccccca accagttcca gtgctccatc accaagcgct gcatccctcg cgtctgggtc 10860 

tgtgacaggg ataatcactg tgtggacggc agtgatgagc ctgccaactg tacccaaatg 10920 

acctgtggag tggatgagtt ccgctgcaag gattctggcc gctgcatccc cgcgcgctgg 10980 

aagtgtgacg gagaagatga ctgtggggat ggttcagatg agcccaagga agagtgtgat 11040 

gagcgcacct gtgagccata ccagttccgc tgcaaaaaca accgctgtgt cccaggccgt 11100 

tggcaatgtg actacgacaa cgactgcgga gataactcgg acgaggagag ctgcacacct 11160 

cggccctgct ctgagagtga gtttttctgt gccaatggcc gctgcatcgc tgggcgctgg 11220 

aagtgtgatg gggaccatga ctgtgccgac ggctcagacg agaaagactg caccccccgc 11280 

tgtgatatgg accagttcca gtgcaagagt ggccactgca tccccctgcg ctggccgtgt 11340 

gacgcggatg ctgactgtat ggacggcagt gacgaggaag cctgtggcac tggggtgagg 11400 

acctgcccat tggatgagtt tcaatgtaac aacaccttgt gcaagccgct ggcctggaag 11460 

tgtgatggag aggacgactg tggggacaac tcagatgaga accccgagga atgcgcccgg 11520 

ttcatctgcc ctcccaaccg gcctttccgc tgcaagaatg accgagtctg cctgtggatt 11580 

gggcgccagt gtgatggcgt ggacaactgt ggagatggga ctgacgagga ggactgtgag 11640 

ccccccacgg cccagaaccc ccactgcaaa gacaagaagg agttcctgtg ccgaaaccag 11700 

cgctgtctat catcctccct gcgctgtaac atgttcgatg actgcggcga tggctccgat 11760 

gaagaagatt gcagcatcga ccccaagctg accagctgtg ccaccaatgc cagcatgtgt 11820 

ggggacgaag ctcgttgtgt gcgcactgag aaagctgcct actgtgcctg ccgctcgggc 11880- 

ttccatactg tgccgggcca gcccggatgc caggacatca acgagtgcct gcgctttggt 11940 

acctgctctc agctctggaa caaacccaag ggaggccacc tctgcagctg tgcccgcaac 12000 
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ttcatgaaga cacacaacac ctgcaaagct gaaggctccg agtaccaggt gctatacatc 12060 

gcggatgaca acgagatccg cagcttgttc ccgggccacc cccactcagc ctacgagcag 12120 

acattccagg gcgatgagag tgtccgcata gatgccatgg atgtccatgt caaggccggc 12180 

cgtgtctact ggactaactg gcacacgggc acaatctcct acaggagcct gccccctgcc 12240 

gcccctccta ccacttccaa ccgccaccgg aggcagatcg accggggtgt cacccacctc 12300 

aatatttcag ggctgaagat gccgaggggt atcgctatcg actgggtggc cgggaatgtg 12360 

tactggaccg attccggccg agacgtgatt gaggtggcgc aaatgaaggg cgagaaccgc 12420 
aagacgctca tctcgggcat gattgatgag ccccatgcca tcgtggtgga ccctctgagg ' 12480 

ggcaccatgt actggtcaga ctgggggaac caccccaaga ttgaaacagc agcgatggat 12540 

ggcacccttc gggagactct cgtgcaagac aacattcagt ggcctacagg gctggctgtg 12600 

gactatcaca atgaacggct ctactgggca gatgccaagc tttcggtcat cggcagcatc 12660 

cggctcaacg gcactgaccc cattgtggct gctgacagca aacgaggcct aagtcacccc 12720 

ttcagcatcg atgtgtttga agactacatc tacggagtca cttacatcaa taatcgtgtc 12780 

ttcaagatcc acaagtttgg acacagcccc ttgtacaacc taactggggg cctgagccat 12840 

gcctctgatg tagtccttta ccatcaacac aagcagcctg aagtgaccaa cccctgtgac -12900 

cgcaagaaat gcgaatggct gtgtctgctg agccccagcg ggcctgtctg cacctgtccc 12960 

aatggaaaga ggctggataa tggcacctgt gtgcctgtgc cctctccaac accccctcca 13020 

gatgccccta ggcctggaac ctgcactctg cagtgcttca atggtggtag ttgtttcctc 13080 

aacgctcgga ggcagcccaa gtgccgttgc cagccccgtt acacaggcga taagtgtgag 13140 

ctggatcagt gctgggaata ctgtcacaac ggaggcacct gtgcggcttc cccatctggc 13200 

atgcccacgt gccgctgtcc cactggcttc acgggcccca aatgcaccgc acaggtgtgt 13260 

gcaggctact gctctaacaa cagcacctgc accgtcaacc agggcaacca gccccagtgc 13320 

cgatgtctac ctggcttcct gggcgaccgt tgccagtacc ggcagtgctc tggcttctgt 13380 

gagaactttg gcacctgtca gatggctgct gatggctccc gacaatgtcg ctgcaccgtc 13440 

tactttgagg gaccaaggtg tgaggtgaac aagtgtagtc gctgtctcca aggcgcctgt 13500 

gtggtcaata agcagaccgg agatgtcaca tgcaactgca ctgatggccg ggtagccccc 13560 

agttgtctca cctgcatcga tcactgtagc aatggtggct cctgcaccat gaacagcaag 13620 

atgatgcctg agtgccagtg cccgccccat atgacaggac cccggtgcca ggagcaggtt 13680 

gttagtcagc aacagcctgg gcatatggcc tccatcctga tccctctgct gctgcttctc 13740 

ctgctgcttc tggtggctgg cgtggtgttc tggtataagc ggcgagtccg aggggctaag 13800 

ggcttccagc accagcggat gaccaatggg gccatgaatg tggaaattgg aaaccctacc 13860 

tacaagatgt atgaaggtgg agagcccgat gatgtcgggg gcctactgga tgctgatttt 13920 

gcccttgacc ctgacaagcc taccaacttc accaacccag tgtatgccac gctctacatg 13980 

gggggccacg gcagccgcca ttccctggcc agcacggacg agaagcgaga actgctgggc 14040 

cggggacctg aagacgagat aggagatccc ttggcatagg gccctgcccc gacggatgtc 14100 

cccagaaagc cccctgccac atgagtcttt caatgaaccc cctccccagc cggcccttct 14160 

ccggccctgc cgggtgtaca aatgtaaaaa tgaaggaatt actttttata tgtgagcgag 14220 

caagcgagca agcacagtat tatctctttg catttccttc ctgcctgctc ctcagtatcc 14280 

cccccatgct gccttgaggg ggcggggagg gctttgtggc tcaaaggtat gaaggagtcc 14340 

acatgttccc taccgagcat acccctggaa gcctggcggc acggcctccc caccacgcct 14400 

gtgcaagaca ctcaacgggg ctccgtgtcc cagctttcct ttccttggct ctctggggtt 14460 

agttcagggg aggtggagtc ctctgctgac cctgtctgga agatttggct ctagctgagg 14520 

aaggagtctt ttagttgagg gaagtcaccc caaaccccag ctcccacttt caggggcacc 14580 

tctcagatgg ccatgctcag tatcccttcc agacaggccc tcccctctct agcgccccct 14640 

ctgtggctcc tagggctgaa cacattcttt ggtaactgtc ccccaagcct cccatccccc 14700 

tgagggccag gaagagtcgg ggcacaccaa ggaagggcaa gcgggcagcc ccattttggg 14760 

gacgtgaacg ttttaataat ttttgctgaa ttcctttaca actaaataac acagatattg 14820 

ttataaataa aattgtaaaa aaaaaaaaa 14849 



<210> 2 
<211> 4545 
<212> PRT 

<213> Mus mus cuius 
<400> 2 

Met Leu Thr Pro Pro Leu Leu Leu Leu'Val Pro Leu Leu Ser Ala Leu 
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1 








5 






Val 


Ser 


Gly 


Ala 
20 


Thr 


Met 


Asp Ala Pro 
25 


Phe 


Ala 


Cys 

35 


Arg 


Asp 


Gin 


He Thr Cys 
40 


Asp 


Gly 
50 


Glu 


Arg 


Asp 


Cys 


Pro Asp Gly 
55 


Cys 


Pro 


Gin 


Ser 


Lys 


Ala 


Gin Arg Cys 


65 










70 




Leu 


Gly 


Thr 


Glu 


Leu 
85 


Cys 


Val Pro Met 


Gin 


Asp 


Cys 


Met 
100 


Asp 


Gly 


Ser Asp Glu 
105 


Arg 


Ala 


Asn 
115 


Cys 


Ser 


Arg 


Met Gly Cys 
120 


Pro 


Ser 
130 


Gly 


Pro 


Thr 


Cys 


Tyr Cys Asn 
135 


Asp 


Gly 


Lys 


Thr 


Cys 


Lys 


Asp Phe Asp 


145 










150 




Cys 


Ser 


Gin 


Leu 


Cys 
165 


Thr 


Asn Thr Asp 


Val 


Glu 


Gly 


Tyr 
180 


Leu 


Leu 


Gin Pro Asp 
185 


Asn 


Glu 


Pro 

195 


Val 


Asp 


Arg 


Pro Pro Val 
200 


Asn 


He 
210 


Leu 


Ala 


Thr 


Tyr 


Leu Ser Gly 
215 


Pro 


Thr 


Ser 


Thr 


Arg 


Gin 


Thr Thr Ala 


225 










230 




Glu 


Thr 


Val 


Cys 


Trp 
245 


Val 


His Val Gly 


Leu 


Lys 


Cys 


Ala 
260 


Arg 


Met 


Pro Gly Leu 
265 


Thr 


He 


Asn 
275 


He 


Ser 


Leu 


Ser Leu His 
280 


Asp 


Trp 
290 


Leu 


Thr 


Gly 


Asn 


Phe Tyr Phe 
295 


He 


Phe 


Val 


Cys 


Asn 


Arg 


Asn Gly Asp 


305 










310 




Leu 


Glu 


Leu 


Tyr 


Asn 

325 


Pro 


Lys Gly He 


Lys 


Val 


Phe 


Phe 
340 


Thr 


Asp 


Tyr Gly Gin 
345 


Asp 


Met 


Asp 
355 


Gly 


Gin 


Asn 


Arg Thr Lys 
360 


Phe 


Pro 
370 


His 


Gly 


He 


Thr 


Leu Asp Leu 

375 


Ala 


Asp 


Ala 


Tyr 


Leu 


Asp 


Tyr He Glu 


385 










390 




Gly 


Arg 


Gin 


Thr 


He 
405 


He 


Gin Gly He 


Leu 


Thr 


Val 


Phe 
420 


Glu 


Asn 


Tyr Leu Tyr 
425 


Asn 


Thr 


Gin 
435 


Gin 


Lys 


Thr 


Ser Val He 
440 


Thr 


Glu 
450 


Tyr 


Gin 


Val 


Val 


Thr Arg Val 
455 
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10 




15 




Lys Thr Cys Ser 


Pro 


Lys 


Gin 




30 






He Ser Lys Gly 


Trp 


Arg 


Cys 


45 








Ser Asp Glu Ala 


Pro 


Glu 


He 


60 








Pro Pro Asn Glu 


His 


Ser 


Cys 


75 






80 


Ser Arg Leu Cys 


Asn 


Gly 


He 


90 




95 




Gly Ala His Cys 


Arg 


Glu 


Leu 




110 






Gin His His Cys 


Val 


Pro 


Thr 


125 








Ser Ser Phe Gin 


Leu 


Glu 


Ala 


140 








Glu Cys Ser Val 


Tyr 


Gly 


Thr 


155 






160 


Gly Ser Phe Thr 


Cys 


Gly 


Cys 


170 




175 




Asn Arg Ser Cys 


Lys 


Ala 


Lys 




190 






Leu Leu He Ala 


Asn 


Ser 


Gin 


205 








Ala Gin Val Ser 


Thr 


He 


Thr 


220 








Met Asp Phe Ser 


Tyr 


Ala 


Asn 


235 






240 


Asp Ser Ala Ala 


Gin 


Thr 


Gin 


250 




255 




Lys Gly Phe Val 


Asp 


Glu 


His 




270 






His Val Glu Gin 


Met 


Ala 


He 


285 








Val Asp Asp He 


Asp 


Asp 


Arg 


300 








Thr Cys Val Thr 


Leu 


Leu 


Asp 


315 






320 


Ala Leu Asp Pro 


Ala 


Met 


Gly 


330 




335 




He Pro Lys Val 


Glu 


Arg 


Cys 




350 






Leu Val Asp Ser 


Lys 


He 


Val 


365 








Val Ser Arg Leu 


Val 


Tyr 


Trp 


380 








Val Val Asp Tyr 


Glu 


Gly 


Lys 
* 


395 






400 


Leu He Glu His 


Leu 


Tyr 


Gly 


410 




415 




Ala Thr Asn Ser 


Asp 


Asn 


Ala 




430 






Arg Val Asn Arg 


Phe 


Asn 


Ser 


445 








Asp Lys Gly Gly 


Ala 


Leu 


His 



460 



6 
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lie 


Tyr 


His 


Gin 


Arg 


Arg 


Gin 


Pro 


Arg 


Val 


Arg 


Ser 


His 


Ala 


Cys 


Glu 


465 










470 










475 










480 


Asn 


Asp 


Gin 


Tyr 


Gly 


Lys 


Pro 


Gly 


Gly 


Cys 


Ser 


Asp 


He 


Cys 


Leu 


Leu 










485 










490 










495 




Ala 


Asn 


Ser 


His 


Lys 


Ala 


Arg 


Thr 


Cys 


Arg 


Cys 


Arg 


Ser 


Gly 


Phe 


Ser 








500 










505 










510 






Leu 


Gly 


Ser 


Asp 


Gly 


Lys 


Ser 


Cys 


Lys 


Lys 


Pro 


Glu 


His 


Glu 


Leu 


Phe 






515 










520 










525 








Leu 


Val 


Tyr 


Gly 


Lys 


Gly 


Arg 


Pro 


Gly 


He 


He 


Arg 


Gly 


Met 


Asp 


Met 




530 










535 










540 










Gly 


Ala 


Lys 


Val 


Pro 


Asp 


Glu 


His 


Met 


He 


Pro 


He 


Glu 


Asn 


Leu 


Met 


545 










550 










555 










560 


Asn 


Pro 


Arg 


Ala 


Leu 


Asp 


Phe 


His 


Ala 


Glu 


Thr 


Gly 


Phe 


He 


Tyr 


Phe 










565 










570 










575 




Ala 


Asp 


Thr 


Thr 


Ser 


Tyr 


Leu 


He 


Gly 


Arg 


Gin 


Lys 


He 


Asp 


Gly 


Thr 








580 










585 










590 






Qlu 


Arg 


Qlu 


Thr 


He 


Leu 


Lys 


Asp 


Gly 


He 


His 


Asn 


Val 


Glu 


Gly 


val 






595 










600 










605 








Ala 


Val 


Asp 


Trp 


Met 


Gly 


Asp 


Asn 


Leu 


Tyr 


Trp 


Thr 


Asp 


Asp 


Gly 


Pro 
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Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys 

930 935 940 

Ala Ser Gly Arg Cys lie Pro lie Ser Trp Thr Cys Asp Leu Asp Asp 
945 950 955 960 

Asp Cys Gly Asp Arg Ser Asp Qlu Ser Ala Ser Cys Ala Tyr Pro Thr 

965 970 975 

Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys lie Asn 

980 985 990 

lie Asn Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp 

995 1000 .1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 . 1020 

Ser Gly Arg Cys He Pro Glu His Txp Thr Cys Asp Gly Asp Asn Asp 
1025 1030 1035 104( 

Cys Qly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 1100 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys He 
1105 1110 1115 112( 

Ser Lys Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 1130 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

1140 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 1180 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
1185 1190 1195 1201 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe He He Phe Ser Asn Arg His Glu He 
1265 1270 1275 1281 

Arg Arg He Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu 

1315 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr 
1345 1350 1355 1361 

Trp Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala 



1380 



1385 



1390 
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He Ala Leu Asp Pro Arg Asp Gly He Leu Phe Trp Thr Asp Trp Asp 

1395 1400 1405 

Ala Ser Leu Pro Arg lie Glu Ala Ala Ser Met Ser Oly Ala Gly Arg 

1410 1415 1420 

Arg Thr He His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
1425 1430 1435 144( 

Thr Val Asp Tyr Leu Glu Lys Arg He Leu Trp He Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Tip Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
1505 1510 1515 152( 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 1550 

Leu Cys Leu He Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp 
1585 1590 1595 160( 

Ala Pro lyr Tyr Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
1665 1670 1675 168( 

Asn Lys Lys Oln He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He 
1745 1750 1755 176( 

Ser Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val He Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
1825 1830 1835 1841 

He Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 



1845 



1850 



1855 
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Asp Cya Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 

1860 1865 1870 

Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1875 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly He Arg Gly 

1890 1895 1900 

He Pro Leu Asp Pro Asn Aap Lys Ser Asp Ala Leu Val Pro Val Ser 
1905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr 

1925 1930 1935 

He Tyr Trp Val Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg 

1940 1945 1950 

Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val 

1955 1960 1965 

Glu Gly He Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
1985 1990 1995 2000 

Tyr Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 2025 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
2065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
2145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
2225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He 

2245 .2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
2305 2310 2315 2320 
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Phe Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 

2340 2345 2350 

Asn Glu Leu His Pro Ser He Met Arg Ala Ala Leu Ser Gly Ala Asn 

2355 2360 2365 

Val Leu Thr Leu He Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala 

2370 2375 2380 

He Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 
2385 2390 2395 2400 

Lys He Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu 

2405 2410 2415 

Lys Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His 

2420 2425 2430 

He Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 

2435 2440 2445 

Tyr Val Gly Ser Asp Met Lys Leu Leu Arg Val Asp He Pro Gin Qln 

2450 2455 2460 

Pro Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
2465 2470 2475 2480 

Ser Pro Cys Arg He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 2495 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu 

2500 2505 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys He Ser Phe Ser Leu Thr 

2530 2535 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
2545 2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val 

2595 2600 2605 

Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys He Gly Asn Ser Ser Arg 

2610 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
2625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 

2660 2665 2670 

Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 

2675 2680 2685 

Cys Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys 

2690 2695 2700 

Pro Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
2705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp 

2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 2760 2765 

His Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly 
2770 2775 2780 
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Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Qly Asp Lys Asp 
2785 2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 

2805 2810 2815 

Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys lie 

2820 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
2865 2870 2875 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
2945 2950 2955 2960 

Lys He Gly Phe Lys Cys Arg Cys Arg Pro Qly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2960 2985 2990 

Cys Ser Gin Leu Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 3005 

Val Glu Gly Tyr Ala Pro Arg Qly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu 
3025 3030 3035 3040 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met He 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
3105 3110 3115 3120 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser He He 

3170 3175 3180 

Val Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
3185 3190 3195 3200 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 3225 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 
3235 3240 3245 
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Asp Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
3265 3270 3275 3280 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cye Lys 

3285 3290 3295 

Val Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly 

3300 3305 3310 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly • 

3315 3320 3325 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 

3330 3335 3340 

Asp Lys Cys He Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
3345 3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

He Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 

3395 3400 3405 

Cys Asp He His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr 

3410 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 
3425 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 

3460 3465 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
3505 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 

3555 3560 3565 

Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Phe Cys Ala 

3570 3575 3580 

Asn Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
3585 3590 3595 3600 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 3630 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 

3635 3640 3645 

Gly Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
3665 3670 3675 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe He Cys 

3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 
3700 3705 3710 
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lie Gly Arg Gin Cys Asp Oly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Qlu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
3745 3750 3755 3760 

Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Oly Ser Asp Glu Glu Asp 

3765 3770 3775 

Cys Ser lie Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

3795 3800 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin 

3810 3815 3820 

Asp lie Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
3825 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

lie Ala Asp Asp Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg lie Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
3905 3910 3915 3920 

His Thr Gly Thr lie Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 

Thr thr Ser Asn Arg His Arg Arg Gin lie Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val lie Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
3985 3990 3995 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met 

4005 4010 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro 

4035 4040 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro 
4065 4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg 

4100 4105 4110 

Val Phe Lys He His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val T2ir Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
4145 4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 
4165 4170 4175 
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Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 

4210 4215 4220 

Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 
4225 4230 4235 4240 

Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr 

4245 4250 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 4270 

Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 

4275 4280 4285 * 

Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys 

4290 4295 4300 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
4305 4310 4315 4320 ' 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

4325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4355 4360 4365 

Gly Arg Val Ala Pro Ser Cys Leu Thr Cys He Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
4385 4390 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin Pro Gly His Met Ala Ser He Leu He Pro Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4450 4455 4460 

Met Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
4465 4470 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu He Gly Asp Pro Leu 
4530 4535 4540 

Ala 
4545 

<210> 3 

<211> 4577 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact 60 

ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc 120 

tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa 180 
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gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc 240 
tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg 300 
tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca 360 
agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga 420 
cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt 480 
tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata 540 
cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg 600 
cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt 660 
ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct 720 
tcccaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat 780 
gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt 840 
gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt 900 
ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttc tat cage aagtaaaaac 960 

caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat 1020 

ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac 1080 

cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt 1140 

tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag tcatattcat 12 00 

cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gccttgtaca 1260 

gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa 1320 

ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca 1380 

cactgcttat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca 1440 

tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac 1500 

cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt 1560 

ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggcc atttttccat 1620 

ctcaatccct gtgaagtcag acattgctcc tgtcgctcgg ttgctcatct atgctgtttt 1680 

acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa 1740 

caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg 1800 

agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct 1860 

catgaagcct gatgctgagc tctcggcgtc ctcggtttac aacctgctac cagaaaagga 1920 

cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgca tcaatcgtca 1980 

taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat 2040 

gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttcgtaaacc 2100 

caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg 2160 

tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagcc 2220 

tcacacggag accgtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt 22 80 

aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa 2340 

ggcaggggcc ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct 2400 

ccgagccttc cagcccttct ttgtggagct tacaatgcct tactctgtga ttcgtggaga 2460 

ggccttcaca ctcaaggcca cggtcctaaa ctaccttccc aaatgcatcc gggtcagtgt 2520 

gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca 2580 

ctgcatctgt gcaaacgggc ggcaaactgt gtcctgggca gtaaccccaa agtcattagg 2640 

aaatgtgaat ttcactgtga gcgcagaggc actagagtct caagagctgt gtgggactga 2700 

ggtgccttca gttcctgaac acggaaggaa agacacagtc atcaagcctc tgttggttga 2760 

acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga 2820 

ggtttctgaa gaattatccc tgaaactgcc accaaatgtg gtagaagaat ctgcccgagc 2880 

ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct 2940 

ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgctccta acatctatgt 3000 

actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggccattgg 3060 

ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag 3120 

cacctttggg gagcgatatg gcaggaacca gggcaacacc tggctcacag cctttgttct 3180 

gaagactttt gcccaagctc gagcctacat cttcatcgat gaagcacaca ttacccaagc 3240 

cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact 3300 

gctcaacaat gccataaagg gaggagtaga agatgaagtg accctctccg cctatatcac 3360 

catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt 3420 

ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac 3480 

caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt 3540 

actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc 3600 

tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt 3660 
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ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga 3720 

ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg 3780 

cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc 3840 

cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt 3900 

ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc 3960 

agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac 4020 

ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca 4080 

gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag 4140 

tgtcagttac acagggagcc gctctgcctc caacatggcg atcgttgatg tgaagatggt 4200 

ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag 4260 

ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac 4320 

actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat 4380 

agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc 4440 

ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag 4500 

tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat 4560 

aaacactttt tctggtc 4577 

<210> 4 

<211> 4422 

<212> DNA 

<213> Homo sapiens 

<400> 4 

atggggaaga acaaactcct tcatccaagt ctggttcttc tcctcttggt cctcctgccc 60 

acagacgcct cagtctctgg aaaaccgcag tatatggttc tggtcccctc cctgctccac 120 

actgagacca ctgagaaggg ctgtgtcctt ctgagctacc tgaatgagac agtgactgta 180 

agtgcttcct tggagtctgt caggggaaac aggagcctct tcactgacct ggaggcggag 240 

aatgacgtac tccactgtgt cgccttcgct gtcccaaagt cttcatccaa tgaggaggta 300 

atgttcctca ctgtccaagt gaaaggacca acccaagaat ttaagaagcg gaccacagtg 360 

atggttaaga acgaggacag tctggtcttt gtccagacag acaaatcaat ctacaaacca 420 

gggcagacag tgaaatttcg tgttgtctcc atggatgaaa actttcaccc cctgaatgag 480 

ttgattccac tagtatacat tcaggatccc aaaggaaatc gcatcgcaca atggcagagt 540 

ttccagttag agggtggcct caagcaattt tcttttcccc tctcatcaga gcccttccag 600 

ggctcctaca aggtggtggt acagaagaaa tcaggtggaa ggacagagca ccctttcacc 660 

gtggaggaat ttgttcttcc caagtttgaa gtacaagtaa cagtgccaaa gataatcacc 720 

atcttggaag aagagatgaa tgtatcagtg tgtggcctat acacatatgg gaagcctgtc 780 

cctggacatg tgactgtgag catttgcaga aagtatagtg acgcttccga ctgccacggt 840 

gaagattcac aggctttctg tgagaaattc agtggacagc taaacagcca tggctgcttc 900 

tatcagcajsig taaaaaccaa ggtcttccag ctgaagagga aggagtatga aatgaaactt 960 

cacactgagg cccagatcca agaagaagga acagtggtgg aattgactgg aaggcagtcc 1020 

agtgaaatca caagaaccat aaccaaactc tcatttgtga aagtggactc acactttcga 1080 

cagggaattc ccttctttgg gcaggtgcgc ctagtagatg ggaaaggcgt ccctatacca 1140 

aataaagtca tattcatcag aggaaatgaa gcaaactatt actccaatgc taccacggat 1200 

gagcatggcc ttgtacagtt ctctatcaac accaccaacg ttatgggtac ctctcttact 1260 

gttagggtca attacaagga tcgtagtccc tgttacggct accagtgggt gtcagaagaa 1320 

cacgaagagg cacatcacac tgcttatctt gtgttctccc caagcaagag ctttgtccac 1380 

cttgagccca tgtctcatga actaccctgt ggccatactc agacagtcca ggcacattat 1440 

attctgaatg gaggcaccct gctggggctg aagaagctct ccttttatta tctgataatg 1500 

gcaaagggag gcattgtccg aactgggact catggactgc ttgtgaagca ggaagacatg 1560 

aagggccatt tttccatctc aatccctgtg aagtcagaca ttgctcctgt cgctcggttg 1620 

ctcatctatg ctgttttacc taccggggac gtgattgggg attctgcaaa atatgatgtt 1680 

gaaaattgtc tggccaacaa ggtggatttg agcttcagcc catcacaaag tctcccagcc 1740 

tcacacgccc acctgcgagt cacagcggct cctcagtccg tctgcgccct ccgtgctgtg 1800 

gaccaaagcg tgctgctcat gaagcctgat gctgagctct cggcgtcctc ggtttacaac 1860 

ctgctaccag aaaaggacct cactggcttc cctgggcctt tgaatgacca ggacgatgaa 1920 

gactgcatca atcgtcataa tgtctatatt aatggaatca catatactcc agtatcaagt 1980 

acaaatgaaa aggatatgta cagcttccta gaggacatgg gcttaaaggc attcaccaac 2040 

tcaaagattc gtaaacccaa aatgtgtcca cagcttcaac agtatgaaat gcatggacct 2100 
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gaaggtctac gtgtaggttt ttatgagtca gatgtaatgg gaagaggcca tgcacgcctg 2160 

gtgcatgttg aagagcctca cacggagacc gtacgaaagt acttccctga gacatggatc 2220 

tgggatttgg tggtggtaaa ctcagcaggg gtggctgagg taggagtaac agtccctgac 2280 

accatcaccg agtggaaggc aggggccttc tgcctgtctg aagatgctgg acttggtatc 2340 

tcttccactg cctctctccg agccttccag cccttctttg tggagcttac aatgccttac 2400 

tctgtgattc gtggagaggc cttcacactc aaggccacgg tcctaaacta ccttcccaaa 2460 

tgcatccggg tcagtgtgca gctggaagcc tctcccgcct tccttgctgt cccagtggag 2520 

aaggaacaag cgcctcactg catctgtgca aacgggcggc aaactgtgtc ctgggcagta 2580 

accccaaagt cattaggaaa tgtgaatttc actgtgagcg cagaggcact agagtctcaa 2640 

gagctgtgtg ggactgaggt gccttcagtt cctgaacacg gaaggaaaga cacagtcatc 2700 

aagcctctgt tggttgaacc tgaaggacta gagaaggaaa caacattcaa ctccctactt 2760 

tgtccatcag gtggtgaggt ttctgaagaa ttatccctga aactgccacc aaatgtggta 2820 

gaagaatctg cccgagcttc tgtctcagtt ttgggagaca tattaggctc tgccatgcaa 2880 

aacacacaaa atcttctcca gatgccctat ggctgtggag agcagaatat ggtcctcttt 2940 

gctcctaaca tctatgtact ggattatcta aatgaaacac agcagcttac tccagaggtc 3000 

aagtccaagg ccattggcta tctcaacact ggttaccaga gacagttgaa ctacaaacac 3060 

tatgatggct cctacagcac ctttggggag cgatatggca ggaaccaggg caacacctgg 3120 

ctcacagcct ttgttctgaa gacttttgcc caagctcgag cctacatctt catcgatgaa 3180 

gcacacatta cccaagccct catatggctc tcccagaggc agaaggacaa tggctgtttc 3240 

aggagctctg ggtcactgct caacaatgcc ataaagggag gagtagaaga tgaagtgacc 3300 

ctctccgcct atatcaccat cgcccttctg gagattcctc tcacagtcac tcaccctgtt 3360 

gtccgcaatg ccctgttttg cctggagtca gcctggaaga cagcacaaga aggggaccat 3420 

ggcagccatg tatataccaa agcactgctg gcctatgctt ttgccctggc aggtaaccag 3480 

gacaagagga aggaagtact caagtcactt aatgaggaag ctgtgaagaa agacaactct 3540 

gtccattggg agcgccctca gaaacccaag gcaccagtgg ggcattttta cgaaccccag 3600 

gctccctctg ctgaggtgga gatgacatcc tatgtgctcc tcgcttatct cacggcccag 3660 

ccagccccaa cctcggagga cctgacctct gcaaccaaca tcgtgaagtg gatcacgaag 3720 

cagcagaatg cccagggcgg tttctcctcc acccaggaca • cagtggtggc tctccatgct 3780 

ctgtccaaat atggagccgc cacatttacc aggactggga aggctgcaca ggtgactatc 3840 

cagtcttcag ggacattttc cagcaaattc caagtggaca acaacaatcg cctgttactg 3900 

cagcaggtct cattgccaga gctgcctggg gaatacagca tgaaagtgac aggagaagga 3960 

tgtgtctacc tccagacctc cttgaaatac aatattctcc cagaaaagga agagttcccc 4020 

tttgctttag gagtgcagac tctgcctcaa acttgtgatg aacccaaagc ccacaccagc 4080 

ttccaaakct ccctaagtgt cagttacaca gggagccgct ctgcctccaa catggcgatc 4140 

gttgatgtga agatggtctc tggcttcatt cccctgaagc caacagtgaa aatgcttgaa 4200 

agatctaacc atgtgagccg gacagaagtc agcagcaacc atgtcttgat ttaccttgat 4260 

aaggtgtcaa atcagacact gagcttgttc ttcacggttc tgcaagatgt cccagtaaga 4320 

gatctcaaac cagccatagt gaaagtctat gattactacg agacggatga gtttgcaatc 4380 

gctgagtaca atgctccttg cagcaaagat cttggaaatg ct 4422 

<210> 5 
<211> 1474 
<212> PRT 

<213> Homo sapiens 



<400> 5 



Met 


Qly Lys 


Asn 


Lys 


Leu Leu 


His 


Pro 


Ser 


Leu 


Val Leu Leu Leu Leu 


1 






5 








10 




15 


Val 


Leu Leu 


Pro 


Thr 


Asp Ala 


Ser 


Val 


Ser 


Gly 


Lys Pro Gin Tyr Met 






20 








25 






30 


Val 


Leu Val 


Pro 


Ser 


Leu Leu 


His 


Thr 


Glu 


Thr 


Thr Glu Lys Gly Cys 




35 








40 








45 


Val 


Leu Leu 


Ser 


Tyr 


Leu Asn 


Glu 


Thr 


Val 


Thr 


Val Ser Ala Ser Leu 




50 






55 










60 


Glu 


Ser Val 


Arg 


Gly 


Asn Arg 


Ser 


Leu 


Phe 


Thr 


Asp Leu Glu Ala Glu 


65 








70 








75 


80 


Asn 


Asp Val 


Leu 


His 


Cys Val 


Ala 


Phe 


Ala 


Val 


Pro Lys Ser Ser Ser 








85 








90 




95 
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Asn 


Glu 


Glu 


Val 


Met 


Phe 


Leu 


Thr 


Val 


Gin 


Val 


Lys 


Gly 


Pro 


Thr 


Gin 








100 










105 










110 






Glu 


Phe 


Lys 


Lys 


Arg 


Thr 


Thr 


Val 


Met 


Val 


Lys 


Asn 


Glu 


Asp 


Ser 


Leu 






115 










120 










125 








Val 


Phe 


Val 


Qln 


Thr 


Asp 


Lys 


Ser 


He 


Tyr 


Lys 


Pro 


Gly 


Gin 


Thr 


Val 




130 










135 










140 










Lys 


Phe 


Arg 


Val 


Val 


Ser 


Met 


Asp 


Glu 


Asn 


Phe 


His 


Pro 


Leu 


Asn 


Glu 


145 










150 










155 










160 


Leu 


lie 


Pro 


Leu 


Val 


Tyr 


lie 


Gin 


Asp 


Pro 


Lys 

mm J mm 


Gly 


Asn 


Atq 


He 


Ala 










165 










170 










175 




Gin 


Trp 


Gin 


Ser 


Phe 


Gin 


Leu 


Glu 


Gly 


Gly 


Leu 


Lvs 


Gin 


Phe 


Ser 


Phe 








180 










185 










190 






Pro 


Leu 


Ser 


Ser 


•Glu 


Pro 


Phe 


Gin 


Gly 


Ser 


Tvr 


Lvs 


Val 


Val 


Val 


Qln 






195 










200 










205 








Lys 


LVS 


Ser 


Gly 


Gly 


Arq 


Thr 


Glu 


His 


Pro 


Phe 


Thr 


Val 


Glu 


Glu 


Phe 




210 










215 










220 










Val 


Leu 


Pro 


Lvs 


Phe 


Glu 


Val 


Qln 


Val 


Thr 


Val 


Pro 


Lvs 


He 


He 


Thr 


225 










230 










235 










240 


lie 


Leu 


Glu 


Glu 


Glu 


Met 


Asn 


Val 


Ser 


Val 


Cvs 


Glv 


Leu 


Tvr 


Thr 


Tvr 










245 










250 










255 




Gly 


Lys 


Pro 


Val 


Pro 


Gly 


His 


Val 


Thr 


Val 


Ser 


He 


Cvs 


Ara 


Lvs 


Tvr 








260 










265 










270 






Ser 


Asp 


Ala 


Ser 


Asp 


Cvs 


His 


Gly 


Glu 


Asp 


Ser 


Gin 


Ala 


Phe 


Cvs 


Glu 






275 










280 










285 








Lys 


Phe 


Ser 


Gly 


Gin 


Leu 


Asn 


Ser 


His 


Glv 


Cvs 


Phe 


Tvr 


Gin 


Gin 


Val 




290 










295 










300 










Lve 


Thr 


Lvs 


Val 


Phe 


Gin 


Leu 


Lvs 


Aro 


Lys 


Glu 


Tvr 


Glu 


Met 


Lys 


Leu 


305 










310 










315 










320 


His 


Thr 


Glu 


Ala 


Gin 


He 


Gin 


Glu 


Glu 


Glv 


Thr 


Val 


Val 


Glu 


Leu 


Thr 










325 










330 










335 




Gly 




Gin 


Ser 


Ser 


Glu 


He 


Thr 


Ara 


Thr 


He 


Thr 


Lys 


Leu 


Ser 


Phe 








340 










345 










350 






Val 


Lys 


Val 


ASD 


Ser 


Hia 


Phe 


Arcr 


Gin 


Glv 


He 


Pro 


Phe 


Phe 


Qlv 


Gin 






355 










360 










365 








Val 




Leu 


Val 


Asp 


Gly 


Lvs 


Glv 


Val 


Pro 


He 


Pro 


Asn 


Lys 


Val 


He 




370 










375 










380 










Phe 


lie 


Ara 


Gly 


Asn 


Glu 


Ala 


Asn 


Tvr 


Tvr 


Ser 


Asn 


Ala 


Thr 


Thr 


Asp 


385 










390 










395 










400 


Glu 


His 


Gly 


Leu 


Val 


Gin 


Phe 


Ser 


He 


Asn 


Thr 


Thr 


Asn 


Val 


Met 


Glv 










405 










410 










415 




Thr 


Ser 


Leu 


Thr 


Val 


Ara 


Val 


Asn 


Tvr 


Lvs 


Asp 


Aro 


Ser 


Pro 


CVR 


Tvr 








420 










425 










430 






Gly 


Tyr 


Gin 


Trp 


Val 


Ser 


Glu 


Glu 


His 


Glu 


Glu 


Ala 


His 


His 


Thr 


Ala 






435 










440 










445 








Tyr 


Leu 


Val 


Phe 


Ser 


Pro 


Ser 


Lvs 


Ser 


Phe 


Val 


His 


Leu 


Glu 


Pro 


Met 




450 










455 










460 










Ser 


His 


Glu 


Leu 


Pro 


Cvs 


Gly 


His 


Thr 


Gin 


Thr 


Val 


Gin 


Ala 


His 


Tvr 


465 










470 










475 










480 


lie 


Leu 


Asn 


Qlv 


Glv 


Thr 


Leu 


Leu 


Glv 


Leu 


Lys 


Lys 


Leu 


Ser 


Phe 


Tvr 










485 










490 










495 




Tvr 


Leu 


lie 


Met 


Ala 


Lys 


Glv 


Qlv 


He 


Val 




Thr 


Gly 


Thr 


His 


Glv 








500 










505 










510 






Leu 


Leu 


Val 


Lys 


Gin 


Glu 


Asp 


Met 


Lys 


Gly 


His 


Phe 


Ser 


He 


Ser 


He 






515 










520 










525 








Pro 


Val 


Lys 


Ser 


Asp 


He 


Ala 


Pro 


Val 


Ala 


Arg 


Leu 


Leu 


He 


Tyr 


Ala 




530 










535 










540 










Val 


Leu 


Pro 


Thr 


Gly 


Asp, 


Val 


He 


Gly 


Asp 


Ser 


Ala 


Lys 


Tyr 


Asp 


Val 


545 










550 










555 










560 
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Qlu 


Asn 


C?ys 


Leu 


Ala 


Asn 


Lys 


Val 


Asp 


Leu 


Ser 


Phe 


Ser 


Pro 


Ser 


Gin 










565 










570 










575 




Ser 


Leu 


Pro 


Ala 


Ser 


His 


Ala 


His 


Leu 


Arg 


Val 


Thr 


Ala 


Ala 


Pro 


Gin 








580 










585 










590 






Ser 


Val 


Cys 


Ala 


Leu 


Arg 


Ala 


Val 


Asp 


Gin 


Ser 


Val 


Leu 


Leu 


Met 


Lvs 






595 










600 










605 






Pro 


Asp 


Ala 


Glu 


Leu 


Ser 


Ala 


Ser 


Ser 


Val 


Tyr 


Asn 


Leu 


Leu 


Pro 


Glu 




610 










615 










620 










Lys 


Asp 


Leu 


Thr 


Gly 


Phe 


Pro 


Gly 


Pro 


Leu 


Asn 


Asp 


Gin 


Asp 


Asp 


Glu 


625 










630 










635 










640 


Asp 


Cvs 


lie 


Asn 


Arq 


His 


Asn 


Val 


Tvr 

JT ^ 


He 


Asn 


Glv 


He 


Thr 


Tvr 


Thr 










645 










650 










655 




Pro 


Val 


Ser 


Ser 


Thr 


Asn 


Glu 


Lvs 


ASD 


Met 


Tvr 


Ser 


Phe 


Leu 


Glu 


Asp 








660 










665 










670 






Met 


Gly 


Leu 


Lys 


Ala 


Phe 


Thr 


Asn 


Ser 


Lvs 


He 


Arq 


Lvs 


Pro 


Lvs 


Met 






675 










660 










685 








Cys 


Pro 


Gin 


Leu 


Gin 


Gin 


Tyr 


Glu 


Met 


His 


Gly 


Pro 


Glu 


Glv 


Leu 


Ara 




690 










695 










700 










Val 


Gly 


Phe 


Tvr 


Glu 


Ser 


Asp 


Val 


Met 


Glv 

VIA J 


ArQ 


Glv 


His 


Ala 


Ara 


Leu 


705 










710 










715 










720 


Val 


His 


Val 


Glu 


Glu 


Pro 


His 


Thr 


Glu 


Thr 


Val 


Ara 


Lys 


Tvr 


Phe 


Pro 










725 










730 










735 




Qlu 


Thr 


Tro 


He 


Tro 


Aso 


Leu 


Val 


Val 


Val 


Asn 


Ser 


Ala 


Glv 


Val 


Ala 








740 










745 










750 






Glu 


Val 


Gly 


Val 


Thr 


Val 


Pro 


Asp 


Thr 


He 


Thr 


Glu 


Tro 


Lvs 


Ala 


Glv 






755 










760 










765 








Ala 


Phe 


Cys 


Leu 


Ser 


Glu 


Asp 


Ala 


Gly 


Leu 


Glv 


He 


Ser 


Ser 


Thr 


Ala 




770 










775 










780 










Ser 


Leu 


Arq 


Ala 


Phe 


Gin 


Pro 


Phe 


Phe 


Val 


Glu 


Leu 


Thr 


Met 


Pro 




785 










790 










795 










800 


Ser 


Val 


lie 


Ara 


Glv 


Glu 


Ala 


Phe 


Thr 


Leu 


Lys 


Ala 


Thr 


Val 


Leu 


Asn 










805 










810 










815 




Tvr 


Leu 


Pro 


Lvs 


Cvs 


He 


Arcf 


Val 


Ser 


Val 


Gin 


Leu 


Glu 


Ala 


Ser 










820 










825 










830 






Ala 


Phe 


Leu 


Ala 


Val 


Pro 


Val 


Glu 


Lvs 


Glu 


Gin 


Ala 


Pro 


His 


Cys 


He 






835 










840 










645 








Cys 


Ala 


Asn 


Gly 


Ara 


Gin 


Thr 


Val 


Ser 


Tro 


Ala 


Val 


Thr 


Pro 


Lys 


Ser 




850 










855 










860 










Leu 


Gly 


Asn 


Val 


Asn 


Phe 


Thr 


Val 


Ser 


Ala 


Glu 


Ala 


Leu 


Glu 


Ser 


Gin 


865 










870 










875 










880 


Glu 


Leu 


Cys 


Gly 


Thr 


Glu 


Val 


Pro 


Ser 


Val 


Pro 


Glu 


His 


Glv 


~-y 


Lys 










885 










890 










895 




Asp 


Thr 


Val 


He 


Lys 


Pro 


Leu 


Leu 


Val 


Glu 


Pro 


Glu 


Glv 


Leu 


Glu 


Lys 








900 










90S 










910 






Glu 


Thr 


Thr 


Phe 


Asn 


Ser 


Leu 


Leu 


Cys 


Pro 


Ser 


Glv 


Glv 


Glu 


Val 


Ser 






915 










920 










925 








Glu 


Glu 


Leu 


Ser 


Leu 


Lvs 


Leu 


Pro 


Pro 


Asn 


Val 


Val 


Glu 


Qlu 


Ser 


Ala 




930 










935 










940 










Arg 


Ala 


Ser 


Val 


Ser 


Val 


Leu 


Gly 


Asp 


He 


Leu 


Glv 


Ser 


Ala 


Met 


Gin 


945 










950 










955 










960 


Asn 


Thr 


Gin 


Asn 


Leu 


Leu 


Gin 


Met 


Pro 


Tyr 


Gly 


Cys 


Gly 


Qlu 


Gin 


Asn 










965 










970 










975 




Met 


Val 


Leu 


Phe 


Ala 


Pro 


Asn 


He 


Tyr 


Val 


Leu 


Asp 


Tyr 


Leu 


Asn 


Glu 








980 










985 










990 






Thr 


Gin 


Gin 


Leu 


Thr 


Pro 


Glu 


Val 


Lys 


Ser 


Lys 


Ala 


He 


Gly 


Tyr 


Leu 






995 










1000 








1005 






Asn 


Thr 


Gly 


Tyr 


Gin 


Arg 


Gin 


Leu 


Asn 


Tyr 


Lys 


His 


Tyr Asp 


Gly 


Ser 



1010 1015 1020 
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Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr lie 

1045 1050 1055 

Phe lie Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 

1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 

1075 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 

1090 1095 1100 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 

1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 

1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 

1155 ' 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 

1170 1175 1180 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 1190 1195 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 

1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 

1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 

1235 1240 1245 

Ser Ser Thr Gin Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 

1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 

1265 1290 1295 

Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 

1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 

1315 1320 1325 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 

1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 . 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 

1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 

1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 

1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 

1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 

1445 1450 1455 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 1470 

Asn Ala 
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<210> 6 
<211> 14896 
<212> DMA 

<213> Homo sapiens 
<400> 6 

cagcggtgcg agctccaggc ccatgcactg aggaggcgga aacaagggga gcccccagag 60 

ctccatcaag ccccctccaa aggctcccct acccggtcca cgccccccac cccccctccc 120 

cgcctcctcc caattgtgca tttttgcagc cggaggcggc tccgagatgg ggctgtgagc 180 

ttcgcccggg gagggggaaa gagcagcgag gagtgaagcg ggggggtggg gtgaagggtt 240 

tggatttcgg ggcagggggc gcacccccgt cagcaggccc tccccaaggg gctcggaact 300 

ctacctcttc acccacgccc ctggtgcgct ttgccgaagg aaagaataag aacagagaag 360 

gaggaggggg aaaggaggaa aagggggacc ccccaactgg ggggggtgaa ggagagaagt 420 

agcaggacca gaggggaagg ggctgctgct tgcatcagcc cacaccatgc tgaccccgcc 480 

gttgctcctg ctgctgcccc tgctctcagc tctggtcgcg gcggctatcg acgcccctaa 540 

gacttgcagc cccaagcagt ttgcctgcag agatcaaata acctgtatct caaagggctg 600 

gcggtgcgac ggtgagaggg actgcccaga cggatctgac gaggcccctg agatttgtcc 660 

acagagtaag gcccagcgat gccagccaaa cgagcataac tgcctgggta ctgagctgtg 720 

tgttcccatg tcccgcctct gcaatggggt ccaggactgc atggacggct cagatgaggg 780 

gccccactgc cgagagctcc aaggcaactg ctctcgcctg ggctgccagc accattgtgt 840 

ccccacactc gatgggccca cctgctactg caacagcagc tttcagcttc aggcagatgg 900 

caagacctgc aaagattttg atgagtgctc agtgtacggc acctgcagcc agctatgcac 960 

caacacagac ggctccttca tatgtggctg tgttgaagga tacctcctgc agccggataa 1020 

ccgctcctgc aaggccaaga acgagccagt agaccggccc cctgtgctgt tgatagccaa 1080 

ctcccagaac atcttggcca cgtacctgag tggggcccag gtgtctacca tcacacctac 1140 

gagcacgcgg cagaccacag ccatggactt cagctatgcc aacgagaccg tatgctgggt 1200 

gcatgttggg gacagtgctg ctcagacgca gctcaagtgt gcccgcatgc ctggcctaaa 1260 

gggcttcgtg gatgagcaca ccatcaacat ctccctcagt ctgcaccacg tggaacagat 1320 

ggccatcgac tggctgacag gcaacttcta ctttgtggat gacatcgatg ataggatctt 1380 

tgtctgcaac agaaatgggg acacatgtgt cacattgcta gacctggaac tctacaaccc 1440 

caagggcatt gccctggacc ctgccatggg gaaggtgttt ttcactgact atgggcagat 1500 

cccaaaggtg gaacgctgtg acatggatgg gcagaaccgc accaagctcg tcgacagcaa 1560 

gattgtgttt cctcatggca tcacgctgga cctggtcagc cgccttgtct actgggcaga 1620 

tgcctatctg gactatattg aagtggtgga ctatgagggc aagggccgcc agaccatcat 1680 

ccagggcatc ctgattgagc acctgtacgg cctgactgtg tttgagaatt atctctatgc 1740 

caccaactcg gacaatgcca atgcccagca gaagacgagt gtgatccgtg tgaaccgctt 1800 

taacagcacc gagtaccagg ttgtcacccg ggtggacaag ggtggtgccc tccacatcta I860 

ccaccagagg cgtcagcccc gagtgaggag ccatgcctgt gaaaacgacc agtatgggaa 1920 

gccgggtggc tgctctgaca tctgcctgct ggccaacagc cacaaggcgc ggacctgccg 1980 

ctgccgttcc ggcttcagcc tgggcagtga cgggaagtca tgcaagaagc cggagcatga 2040 

gctgttcctc gtgtatggca agggccggcc aggcatcatc cggggcatgg atatgggggc 2100 

caaggtcccg gatgagcaca tgatccccat tgaaaacctc atgaaccccc gagccctgga 2160 

cttccacgct gagaccggct tcatctactt tgccgacacc accagctacc tcattggccg 2220 

ccagaagatt gatggcactg agcgggagac catcctgaag gacggcatcc acaatgtgga 2280 

gggtgtggcc gtggactgga tgggagacaa tctgtactgg acggacgatg ggcccaaaaa 2340 

gacaatcagc gtggccaggc tggagaaagc tgctcagacc cgcaagactt taatcgaggg 2400 

caaaatgaca caccccaggg ctattgtggt ggatccactc aatgggtgga tgtactggac 2460 

agactgggag gaggacccca aggacagtcg gcgtgggcgg ctggagaggg cgtggatgga 2520 

tggctcacac cgagacatct ttgtcacctc caagacagtg ctttggccca atgggctaag 2580 

cctggacatc ccggctgggc gcctctactg ggtggatgcc ttctacgacc gcatcgagac 2640 

gatactgctc aatggcacag accggaagat tgtgtatgaa ggtcctgagc tgaaccacgc 2700 

ctttggcctg tgtcaccatg gcaactacct cttctggact gagtatcgga gtggcagtgt 2760 

ctaccgcttg gaacggggtg taggaggcgc accccccact gtgacccttc tgcgcagtga 2820 

gcggcccccc atctttgaga tccgaatgta tgatgcccag cagcagcaag ttggcaccaa 2880 

caaatgccgg gtgaacaatg gcggctgcag cagcctgtgc ttggccaccc ctgggagccg 2940 

ccagtgcgcc tgtgctgagg accaggtgtt ggacgcagac ggcgtcactt gcttggcgaa 3000 

cccatcctac gtgcctccac cccagtgcca gccaggcgag tttgcctgtg ccaacagccg 3060 

ctgcatccag gagcgctgga agtgtgacgg agacaacgat tgcctggaca acagtgatga 3120 
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ggccccagcc ctctgccatc agcacacctg cccctcggac cgattcaagt gcgagaacaa 3180 

ccggtgcatc cccaaccgct ggctctgcga cggggacaat gactgtggga acagtgaaga 3240 

tgagtccaat gccacttgtt cagcccgcac ctgccccccc aaccagttct cctgtgccag 3300 

tggccgctgc atccccatct cctggacgtg tgatctggat gacgactgtg gggaccgctc 3360 

tgatgagtct gcttcgtgtg cctatcccac ctgcttcccc ctgactcagt ttacctgcaa 3420 

caatggcaga tgtatcaaca tcaactggag atgcgacaat gacaatgact gtggggacaa 3480 

cagtgacgaa gccggctgca gccactcctg ttctagcacc cagttcaagt gcaacagcgg 3540 

gcgttgcatc cccgagcact ggacctgcga tggggacaat gactgcggag actacagtga 3600 

tgagacacac gccaactgca ccaaccaggc cacgaggccc cctggtggct gccacactga 3660 

tgagttccag tgccggctgg atggactatg catccccctg cggtggcgct gcgatgggga 3720 

cactgactgc atggactcca gcgatgagaa gagctgtgag ggagtgaccc acgtctgcga 3780 

tcccagtgtc aagtttggct gcaaggactc agctcggtgc atcagcaaag cgtgggtgtg 3840 

tgatggcgac aatgactgtg aggataactc ggacgaggag aactgcgagt ccctggcctg 3900 

caggccaccc tcgcaccctt gtgccaacaa cacctcagtc tgcctgcccc ctgacaagct 3960 

gtgtgatggc aacgacgact gtggcgacgg ctcagatgag ggcgagctct gcgaccagtg " 4020 

ctctctgaat aacggtggct gcagccacaa ctgctcagtg gcacctggcg aaggcattgt 4080 

gtgttcctgc cctctgggca tggagctggg gcccgacaac cacacctgcc agatccagag 4140 

ctactgtgcc aagcatctca aatgcagcca aaagtgcgac cagaacaagt tcagcgtgaa 4200 

gtgctcctgc tacgagggct gggtcctgga acctgacggc gagagctgcc gcagcctgga 4260 

ccccttcaag ccgttcatca ttttctccaa ccgccatgaa atccggcgca tcgatcttca 4320 

caaaggagac tacagcgtcc tggtgcccgg cctgcgcaac accatcgccc tggacttcca 4380 

cctcagccag agcgccctct actggaccga cgtggtggag gacaagatct accgcgggaa 4440 

gctgctggac aacggagccc tgactagttt cgaggtggtg attcagtatg gcctggccac 4500 

acccgagggc ctggctgtag actggattgc aggcaacatc tactgggtgg agagtaacct 4560 

ggatcagatc gaggtggcca agctggatgg gaccctccgg accaccctgc tggccggtga 4620 

cattgagcac ccaagggcaa tcgcactgga tccccgggat gggatcctgt tttggacaga 4680 

ctgggatgcc agcctgcccc gcattgaggc agcctccatg agtggggctg ggcgccgcac 4740 

cgtgcaccgg gagaccggct ctgggggctg gcccaacggg ctcaccgtgg actacctgga 4800 

gaagcgcatc ■ ctttggattg acgccaggtc agatgccatt tactcagccc gttacgacgg 4860 

ctctggccac atggaggtgc ttcggggaca cgagttcctg tcgcacccgt ttgcagtgac 4920 

gctgtacggg ggggaggtct actggactga ctggcgaaca aacacactgg ctaaggccaa 4980 

caagtggacc ggccacaatg tcaccgtggt acagaggacc aacacccagc cctttgacct 5040 

gcaggtgtac cacccctccc gccagcccat ggctcccaat ccctgtgagg ccaatggggg 5100 

ccagggcccc tgctcccacc tgtgtctcat caactacaac cggaccgtgt cctgcgcctg 5160 

cccccacctc atgaagctcc acaaggacaa caccacctgc tatgagttta agaagttcct 5220 

gctgtacgca cgtcagatgg agatccgagg tgtggacctg gatgctccct actacaacta 5280 

catcatctcc ttcacggtgc ccgacatcga caacgtcaca gtgctagact acgatgcccg 5340 

cgagcagcgt gtgtactggt ctgacgtgcg gacacaggcc atcaagcggg ccttcatcaa 5400 

cggcacaggc gtggagacag tcgtctctgc agacttgcca aatgcccacg ggctggctgt 5460 

ggactgggtc tcccgaaacc tgttctggac aagctatgac accaataaga agcagatcaa 5520 

tgtggcccgg ctggatggct ccttcaagaa cgcagtggtg cagggcctgg agcagcccca 5580 

tggccttgtc gtccaccctc tgcgtgggaa gctctactgg accgatggtg acaacatcag 5640 

catggccaac atggatggca gcaatcgcac cctgctcttc agtggccaga agggccccgt 5700 

gggcctggct attgacttcc ctgaaagcaa actctactgg atcagctccg ggaaccatac 5760 

catcaaccgc tgcaacctgg atgggagtgg gctggaggtc atcgatgcca tgcggagcca 5820 

gctgggcaag gccaccgccc tggccatcat gggggacaag ctgtggtggg ctgatcaggt 5880 

gtcggaaaag atgggcacat gcagcaaggc tgacggctcg ggctccgtgg tccttcggaa 5940 

cagcaccacc ctggtgatgc acatgaaggt ctatgacgag agcatccagc tggaccataa 6000 

gggcaccaac ccctgcagtg tcaacaacgg tgactgctcc cagctctgcc tgcccacgtc 6060 

agagacgacc cgctcctgca tgtgcacagc cggctatagc ctccggagtg gccagcaggc 6120 

ctgcgagggc gtaggttcct ttctcctgta ctctgtgcat gagggaatca ggggaattcc 6180 

cctggatccc aatgacaagt cagatgccct ggtcccagtg tccgggacct cgctggctgt 6240 

cggcatcgac ttccacgctg aaaatgacac catctactgg gtggacatgg gcctgagcac 6300 

gatcagccgg gccaagcggg accagacgtg gcgtgaagac gtggtgacca atggcattgg 6360 

ccgtgtggag ggcattgcag tggactggat cgcaggcaac atctactgga cagaccaggg 6420 

ctttgatgtc atcgaggtcg cccggctcaa tggctccttc cgctacgtgg tgatctccca 6480 

gggtctagac aagccccggg ccatcaccgt ccacccggag aaagggtact tgttctggac 6540 

tgagtggggt cagtatccgc gtattgagcg gtctcggcta gatggcacgg agcgtgtggt 6600 
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gctggtcaac gtcagcatca gctggcccaa cggcatctca gtggactacc aggatgggaa 6660 

gctgtactgg tgcgatgcac ggacagacaa gattgaacgg atcgacctgg agacaggtga 6720 

gaaccgcgag gtggttctgt ccagcaacaa catggacatg ttttcagtgt ctgtgtttga 6780 

ggatttcatc tactggagtg acaggactca tgccaacggc tctatcaagc gcgggagcaa 6840 

agacaatgcc acagactccg tgcccctgcg aaccggcatc ggcgtccagc ttaaagacat 6900 

caaagtcttc aaccgggacc ggcagaaagg caccaacgtg tgcgcggtgg ccaatggcgg 6960 

gtgccagcag ctgtgcctgt accggggccg tgggcagcgg gcctgcgcct gtgcccacgg 7020 

gatgctggct gaagacggag catcgtgccg cgagtatgcc ggctacctgc tctactcaga 7080 

gcgcaccatt ctcaagagta tccacctgtc ggatgagcgc aacctcaatg cgcccgtgca 7140 

gcccttcgag gaccctgagc acatgaagaa cgtcatcgcc ctggcctttg actaccgggc 7200 

aggcacctct ccgggcaccc ccaatcgcat cttcttcagc gacatccact ttgggaacat 7260 

ccaacagatc aacgacgatg gctccaggag gatcaccatt gtggaaaacg tgggctccgt 7320 

ggaaggcctg gcctatcacc gtggctggga cactctctat tggacaagct acacgacatc 7380 

caccatcacg cgccacacag tggaccagac ccgcccaggg gccttcgagc gtgagaccgt 7440 

catcactatg tctggagatg accacccacg ggccttcgtt ttggacgagt gccagaacct 7500 

catgttctgg accaactgga atgagcagca tcccagcatc atgcgggcgg cgctctcggg 7560 

agccaatgtc ctgaccctta tcgagaagga catccgtacc cccaatggcc tggccatcga 7620 

ccaccgtgcc gagaagctct acttctctga cgccaccctg gacaagatcg agcggtgcga 7680 

gtatgacggc tcccaccgct atgtgatcct aaagtcagag cctgtccacc ccttcgggct 7740 

ggccgtgtat ggggagcaca ttttctggac tgactgggtg cggcgggcag tgcagcgggc 7800 

caacaagcac gtgggcagca acatgaagct gctgcgcgtg gacatccccc agcagcccat 7860 

gggcatcatc gccgtggcca acgacaccaa cagctgtgaa ctctctccat gccgaatcaa 7920 

caacggtggc tgccaggacc tgtgtctgct cactcaccag ggccatgtca actgctcatg 7980 

ccgagggggc cgaatcctcc aggatgacct cacctgccga gcggtgaatt cctcttgccg 8040 

agcacaagat gagtttgagt gtgccaatgg cgagtgcatc aacttcagcc tgacctgcga 8100 

cggcgtcccc cactgcaagg acaagtccga tgagaagcca tcctactgca actcccgccg 8160 

ctgcaagaag actttccggc agtgcagcaa tgggcgctgt gtgtccaaca tgctgtggtg 8220 

caacggggcc gacgactgtg gggatggctc tgacgagatc ccttgcaaca agacagcctg 8280 

tggtgtgggc gagttccgct gccgggacgg gacctgcatc gggaactcca gccgctgcaa 8340 

ccagtttgtg gattgtgagg acgcctcaga tgagatgaac tgcagtgcca ccgactgcag 8400 

cagctacttc cgcctgggcg tgaagggcgt gctcttccag ccctgcgagc ggacctcact 8460 

ctgctacgca cccagctggg tgtgtgatgg cgccaatgac tgtggggact acagtgatga 8520 

gcgcgactgc ccaggtgtga aacgccccag atgccctctg aattacttcg cctgccctag 8580 

tgggcgctgc atccccatga gctggacgtg tgacaaagag gatgactgtg aacatggcga 8640 

ggacgagacc cactgcaaca agttctgctc agaggcccag tttgagtgcc agaaccatcg 8700 

ctgcatctcc aagcagtggc tgtgtgacgg cagcgatgac tgtggggatg gctcagacga 8760 

ggctgctcac tgtgaaggca agacgtgcgg cccctcctcc ttctcctgcc ctggcaccca 8820 

cgtgtgcgtc cccgagcgct ggctctgtga cggtgacaaa gactgtgctg atggtgcaga 8880 

cgagagcatc gcagctggtt gcttgtacaa cagcacttgt gacgaccgtg agttcatgtg 8940 

ccagaaccgc cagtgcatcc ccaagcactt cgtgtgtgac cacgaccgtg actgtgcaga 9000 

tggctctgat gagtcccccg agtgtgagta cccgacctgc ggccccagtg agttccgctg 9060 

tgccaatggg cgctgtctga gctcccgcca gtgggagtgt gatggcgaga atgactgcca 9120 

cgaccagagt gacgaggctc ccaagaaccc acactgcacc agcccagagc acaagtgcaa 9180 

tgcctcgtca cagttcctgt gcagcagtgg gcgctgtgtg gctgaggcac tgctctgcaa 9240 

cggccaggat gactgtggcg acagctcgga cgagcgtggc tgccacatca atgagtgtct 9300 

cagccgcaag ctcagtggct gcagccagga ctgtgaggac ctcaagatcg gcttcaagtg 9360 

ccgctgtcgc cctggcttcc ggctgaagga tgacggccgg acgtgtgctg atgtggacga 9420 

gtgcagcacc accttcccct gcagccagcg ctgcatcaac acccatggca gctataagtg 9480 

tctgtgtgtg gagggctatg caccccgcgg cggcgacccc cacagctgca aggctgtgac 9540 

tgacgaggaa ccgtttctga tcttcgccaa ccggtactac ct'gcgcaagc tcaacctgga 9600 

cgggtccaac tacacgttac ttaagcaggg cctgaacaac. gccgttgcct tggattttga 9660 

ctaccgagag cagatgatct actggacaga tgtgaccacc cagggcagca tgatccgaag 9720 

gatgcacctt aacgggagca atgtgcaggt cctacaccgt acaggcctca gcaaccccga 9780 

tgggctggct gtggactggg tgggtggcaa cctgtactgg tgcgacaaag gccgggacac 9840 

catcgaggtg tccaagctca atggggccta tcggacggtg ctggtcagct ctggcctccg 9900 

tgagcccagg gctctggtgg tggatgtgca gaatgggtac ctgtactgga cagactgggg 9960 

tgaccattca ctgatcggcc gcatcggcat ggatgggtcc agccgcagcg tcatcgtgga 10020 

caccaagatc acatggccca atggcctgac gctggactat gtcactgagc gcatctactg 10080 
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ggccgacgcc cgcgaggact acattgaatt tgccagcctg gatggctcca atcgccacgt 10140 

tgtgctgagc caggacatcc cgcacatctt tgcactgacc ctgtttgagg actacgtcta 10200 

ctggaccgac tgggaaacaa agtccattaa ccgagcccac aagaccacgg gcaccaacaa 10260 

aacgctcctc atcagcacgc tgcaccggcc catggacctg catgtcttcc atgccctgcg 10320 

ccagccagac gtgcccaatc acccctgcaa ggtcaacaat ggtggctgca gcaacctgtg 103 80 

cctgctgtcc cccgggggag ggcacaaatg tgcctgcccc accaacttct acctgggcag 10440 

cgatgggcgc acctgtgtgt ccaactgcac ggctagccag tttgtatgca agaacgacaa 10500 

gtgcatcccc ttctggtgga agtgtgacac cgaggacgac tgcggggacc actcagacga 10560 

gcccccggac tgccctgagt tcaagtgccg gcccggacag ttccagtgcfc ccacaggtat 10620 

ctgcacaaac cctgccttca tctgcgatgg cgacaatgac tgccaggaca acagtgacga 10680 

ggccaactgt gacatccacg tctgcttgcc cagtcagttc aaatgcacca acaccaaccg 10740 

ctgtattccc ggcatcttcc gctgcaatgg gcaggacaac tgcggagatg gggaggatga 10800 

Sfagggactgc cccgaggtga cctgcgcccc caaccagttc cagtgctcca ttaccaaacg 10860 

gtgcatcccc cgggtctggg tctgcgaccg ggacaatgac tgtgtggatg gcagtgatga 10920 

gcccgccaac tgcacccaga tgacctgtgg tgtggacgag ttccgctgca aggattcggg 10980 

ccgctgcatc ccagcgcgtt ggaagtgtga cggagaggat gactgtgggg atggctcgga 11040 

tgagcccaag gaagagtgtg atgaacgcac ctgtgagcca taccagttcc gctgcaagaa lllOO 

caaccgctgc gtgcccggcc gctggcagtg cgactacgac aacgattgcg gtgacaactc 11160 

cgatgaagag agctgcaccc ctcggccctg ctccgagagt gagttctcct gtgccaacgg 11220 

ccgctgcatc gcggggcgct ggaaatgcga tggagaccac gactgcgcgg acggctcgga 11280 

cgagaaagac tgcacccccc gctgtgacat ggaccagttc cagtgcaaga gcggccactg 11340 

catccccctg cgctggcgct gtgacgcaga cgccgactgc atggacggca gcgacgagga 11400 

ggcctgcggc actggcgtgc ggacctgccc cctggacgag ttccagtgca acaacacctt 11460 

gtgcaagccg ctggcctgga agtgcgatgg cgaggatgac tgtggggaca actcagatga 11520 

gaaccccgag gagtgtgccc ggttcgtgtg ccctcccaac cggcccttcc gttgcaagaa 11580 

tgaccgcgtc tgtctgtgga tcgggcgcca atgcgatggc acggacaact gtggggatgg 11640 

gactgatgaa gaggactgtg agccccccac agcccacacc acccactgca aagacaagaa 11700 

ggagtttctg tgccggaacc agcgctgcct ctcctcctcc ctgcgctgca acatgttcga 11760 

tgactgcggg gacggctctg acgaggagga ctgcagcatc gaccccaagc tgaccagctg 11820 

cgccaccaat gccagcatct gtggggacga ggcacgctgc gtgcgcaccg agaaagcggc 11880 

ctactgtgcc tgccgctcgg gcttccacac cgtgcccggc cagcccggat gccaagacat 11940 

caacgagtgc ctgcgcttcg gcacctgctc ccagctctgc aacaacacca agggcggcca 12000 

cctctgcagc tgcgctcgga acttcatgaa gacgcacaac acctgcaagg ccgaaggctc 12060 

tgagtaccag gtcctgtaca tcgctgatga caatgagatc cgcagcctgt tccccggcca 12120 

cccccattcg gcttacgagc aggcattcca gggtgacgag agtgtccgca ttgatgctat 12180 

ggatgtccat gtcaaggctg gccgtgtcta ttggaccaac tggcacacgg gcaccatctc 12240 

ctaccgcagc ctgccacctg ctgcgcctcc taccacttcc aaccgccacc ggcgacagat 12300 

tgaccggggt gtcacccacc tcaacatttc agggctgaag atgcccagag gcatcgccat 12360 

cgactgggtg gccggaaacg tgtactggac cgactcgggc cgagatgtga ttgaggtggc 12420 

gcagatgaag ggcgagaacc gcaagacgct catctcgggc atgattgacg agccccacgc 12480 

cattgtggtg gacccactga gggggaccat gtactggtca gactggggca accaccccaa 12540 

gattgagacg gcagcgatgg atgggacgct tcgggagaca ctggtgcagg acaacattca 12600 

gtggcccaca ggcctggccg tggattatca caatgagcgg ctgtactggg cagacgccaa 12660 

gctttcagtc atcggcagca tccggctcaa tggcacggac cccattgtgg ctgctgacag 12720 

caaacgaggc ctaagtcacc ccttcagcat cgacgtcttt gaggattaca tctatggtgt 12780 

cacctacatc aataatcgtg tcttcaagat ccataagttt ggccacagcc ccttggtcaa 12840 

cctgacaggg ggcctgagcc acgcctctga cgtggtcctt taccatcagc acaagcagcc 12900 

cgaagtgacc aacccatgtg accgcaagaa atgcgagtgg ctctgcctgc tgagccccag 12960 

tgggcctgtc tgcacctgtc ccaatgggaa gcggctggac aacggcacat gcgtgcctgt 13020 

gccctctcca acgccccccc cagatgctcc ccggcctgga acctgtaacc tgcagtgctt 13 080 

caacggtggc agctgtttcc tcaatgcacg gaggcagccc aagtgccgct gccaaccccg 13140 

ctacacgggt gacaagtgtg aactggacca gtgctgggag cactgtcgca atgggggcac 13200 

ctgtgctgcc tccccctctg gcatgcccac gtgccggtgc cccacgggct tcacgggccc 13260 

caaatgcacc cagcaggtgt gtgcgggcta ctgtgccaac aacagcacct gcactgtcaa 13320 

ccagggcaac cagccccagt gccgatgcct acccggcttc ctgggcgacc gctgccagta 13380 

ccggcagtgc tctggctact gtgagaactt tggcacatgc cagatggctg ctgatggctc 13440 

ccgacaatgc cgctgcactg cctactttga gggatcgagg tgtgaggtga acaagtgcag 13500 

ccgctgtctc gaaggggcct gtgtggtcaa caagcagagt ggggatgtca cctgcaactg 13560 
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cacggatggc cgggtggccc ccagctgtct gacctgcgtc ggccactgca gcaatggcgg . 13620 

ctcctgtacc atgaacagca aaatgatgcc tgagtgccag tgcccacccc acatgacagg 13680 

gccccggtgt gaggagcacg tcttcagcca gcagcagcca ggacatatag cctccatcct 13740 

aatccctctg ctgttgctgc tgctgctggt tctggtggcc ggagtggtat tctggtataa 13800 

gcggcgagtc caaggggcta agggcttcca gcaccaacgg atgaccaacg gggccatgaa 13 860 

cgtggagatt ggaaacccca cctacaagat gtacgaaggc ggagagcctg atgatgtggg 13 920 

aggcctactg gacgctgact ttgccctgga ccctgacaag cccaccaact tcaccaaccc 13980 

cgtgtatgcc acactctaca tggggggcca tggcagtcgc cactccctgg ccagcacgga 14040 

cgagaagcga gaactcctgg gccggggccc tgaggacgag ataggggacc ccttggcata 14100 

gggccctgcc ccgtcggact gcccccagaa agcctcctgc cccctgccgg tgaagtcctt 14160 

cagtgagccc ctccccagcc agcccttccc tggccccgcc ggatgtataa atgtaaaaat 14220 

gaaggaatta cattttatat gtgagcgagc aagccggcaa gcgagcacag tattatttct 14280 

ccatccGctc cctgcctgct ccttggcacc cccatgctgc cttcagggag acaggcaggg 14340 

agggcttggg gctgcacctc ctaccctccc accagaacgc accccactgg gagagctggt 14400 

ggtgcagcct tcccctccct gtataagaca ctttgccaag gctctcccct ctcgccccat 14460 

ccctgcttgc ccgctcccac agcttcctga gggctaattc tgggaaggga gagttctttg 14520 

ctgcccctgt ctggaagacg tggctctggg tgaggtaggc gggaaaggat ggagtgtttt 14580 

agttcttggg ggaggccacc ccaaacccca gccccaactc caggggcacc tatgagatgg 14640 

ccatgctcaa cccccctccc agacaggccc tccctgtctc cagggccccc accgaggttc 14700 

ccagggctgg agacttcctc tggtaaacat tcctccagcc tcccctcccc tggggacgcc 14760 

aaggaggtgg gccacaccca ggaagggaaa gcgggcagcc ccgttttggg gacgtgaacg 14820 

ttttaataat ttttgctgaa ttctttacaa ctaaataaca cagatattct tataaataaa 14880 

attgtaaaaa aaaaaa 14896 



<210> 7 
<211> 126 
<212> PRT 

<213> Homo sapiens 
<400> 7 



He 


Ala 


Leu 


Asp 


Phe 


His 


Leu Ser 


Gin Ser Ala 


Leu 


Tyr 


Trp 


Thr 


Asp 


1 








5 






10 








15 




Val 


Val 


Glu 


Asp 
20 


Lys 


He 


Tyr Arg 


Gly Lys Leu 
25 


Leu 


Asp 


Asn 
30 


Gly 


Ala 


Leu 


Thr 


Ser 
35 


Phe 


Glu 


Val 


Val He 
40 


Gin Tyr Gly 


Leu 


Ala 
45 


Thr 


Pro 


Glu 


Gly 


Leu 
50 


Ala 


Val 


Asp 


Trp 


He Ala 

55 


Gly Asn He 


Tyr 
60 


Trp 


Val 


Glu 


Ser 


J^sn 


Leu 


Asp 


Gin 


He 


Glu 


Val Ala 


Lys Leu Asp 


Gly 


Thr 


Leu 


Arg 


Thr 


65 










70 




75 










80 


Thr 


Leu 


Leu 


Ala 


Gly 
85 


Asp 


He Glu 


His Pro Arg 
90 


Ala 


He 


Ala 


Leu 
95 


Asp 


Pro 


Arg 


Asp 


Gly 
100 


He 


Leu 


Phe Trp 


Thr Asp Trp 
105 


Asp 


Ala 


Ser 
110 


Leu 


Pro 


Arg 


He 


Glu 
115 


Ala 


Ala 


Ser 


Met Ser 
120 


Gly Ala Gly 


Arg 


Arg 
125 


Thr 







<210> 8 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met 

15 10 15 

Lys Val Tlir Gly Glu Gly Cys Val Tyr Leu Qln Thr Ser Leu Lys Tyr 
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20 








25 


30 




Aan 


He 


Leu 


Pro 


Qlu 


Lvfl 


Glu Qlu 


Phe 


Pro Phe Ala Leu Gly 


Val Gin 






35 








40 




45 




Thr 


Leu 


Pro 


Gin 


Thr 


Cys 


AsD Glu 


Pro 


Lva Ala Hj.s TYit SeT* 


Phia nl n 




50 










55 




60 




He 


Ser 


Leu 


Ser 


Val 


Ser 


'^yr Thr 


Qlv 


S&Y Ara Seir Ala fier* 




65 










70 






75 


80 


Ala 


He 


Val 


Asp 


Val 


Lys 


I*1B *m V CI X 




V Tl o Di"^ T.oif 
c^xiCi XJLC Mr^sj ucu 


liyS IrfO 










85 








90 


95 


Thr 


Val 


Lys 


Met 


Leu 


Glu 


Arg Ser 


Asn 


His Val Ser Arg Thr 


Glu Val 








100 








105 


110 




Ser 


Ser 


Asn 


His 


Val 


Leu 


He Tyr 


Leu 


Asp Lys Val Ser Asn 


Gin Thr 






115 








120 




125 




Leu 


Ser 


Leu 


Phe 


Phe 


Thr 


Val Leu 


Gin 


Asp Val Pro Val Arg 


Asp Leu 




130 










135 




140 


Lys 


Pro 


Ala 


He 


Val 


Lys 


Val Tyr 


Asp 







145 150 



<210> 9 

<211> 138 

<212> PRT 

<213> Homo sapiens 





<400> 


9 






Met 


Lys 


Val 


Thr Gly 


Glu Gly Cys 


1 








5 




Tyr 


Asn 


He 


Leu 


Pro 


Glu Lys Glu 








20 






Gin 


Thr 


Leu 


Pro 


Gin 


Thr Cys Asp 






35 






40 


Gin 


He 


Ser 


Leu 


Ser 


Val Ser Tyr 




50 








55 


Met 


Ala 


He 


Val Asp 


Val Lys Met 


65 










70 


Pro 


Thr 


Val 


Lys 


Met 


Leu Glu Arg 










85 




Val 


Ser 


Ser 


Asn 


His 


Val Leu He 








100 






Thr 


Leu 


Ser 


Leu 


Phe 


Phe Thr Val 






115 






120 


Leu 


Lys 


Pro 


Ala 


He 


Val Lys Val 




130 








135 



Val 


Tyr Leu Gin Thr Ser 


Leu Lys 




10 


15 


Glu 


Phe Pro Phe Ala Leu 


Gly Val 


25 


30 




Glu 


Pro Lys Ala His Thr 


Ser Phe 




45 




Thr 


Gly Ser Arg Ser Ala 


Ser Asn 




60 




Val 


Ser Gly Phe He Pro 


Leu Lys 




75 


80 


Ser 


Asn His Val Ser Arg 


Thr Qlu 




90 


95 


Tyr 


Leu Asp Lys Val Ser 


Asn Gin 


105 


110 




Leu 


Gin Asp Val Pro Val 


Arg Asp 




125 




Tyr 


Asp 





<210> 10 

<211> 27 

<212> PRT 

<213> Homo sapiens 



<400> 10 
Ser Val Ser Tyr Thr Gly Ser Arg 

1 5 
Asp Val Lys Met Val Ser Gly Phe 
20 

<210> 11 
<211> 126 
<212> PRT 

<213> Homo sapiens 



Ser Ala Ser Asn Met Ala He Val 

10 15 
He Pro Leu 
25 
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<400> 


11 












Leu 


Gin 


Qln 


Val 


Ser 


Leu 




JJCU IrJbW \9l>U 


THn^ Qoi* T.xra 

xyx sex f*ieu jjyB 


1 








5 






10 


xs 


Val 


Thr 


Gly 


Glu 


Glv 


Cys 


Val Tvr 


r*eii Oln T'h'K fS^T* 

•UCU VXIX .LUX OCX 


TiOii TiVa nVrY" IV a VI 

xieix xiys xyx ash 








20 








25 


30 


lie 


Leu 


Pro 


Glu 


Lvs 


Glu 


Glu Phe 


Pvo Pli^ A.Ta T«^ii 

JTJUW 


urxy veix uj.ll xxix 






35 








40 




45 


Leu 


Pro 


Gin 


Thr 


Cys 


Asp 


Glu Pro 


TtVfl Ala B TVti^ 
xjya txMCL xx^o xxxx 


Qo*^ Dlio /31rk TIa 
OCX irxie Vaxn xxe 




50 










55 






S&T 


Leu 


Ser 


Val 


Ser 






OCXT -nX^ Dcsx i^Xa 


OCX iiBn new AJ.a 


65 










70 




75 


80 


lie 


Val 


Asp 


Val 


Lys 


Met 


Val Ser 


Gly Phe lie Pro 


Leu Lys Pro Thr 










85 






90 


95 


Val 


Lys 


Met 


Leu 


Glu 


Arg 


Ser Asn 


His Val Ser Arg 


Thr Glu Val Ser 








100 








105 


110 


Ser 


Asn 


His 


Val 


Leu 


lie 


Tyr Leu 


Asp Lys Val Ser 


Asn Gin 






115 








120 




125 





<210> 


12 














<211> 


111 
















PRT 














<z 13> 


Homo sapiens 










<400> 


12 












Leu 


vsin Qln 


Val 


Ser 


Leu Pro 


Glu 


Leu Pro Gly Glu 


Tyr Ser Met Lys 


1 






s 






10 


15 


vax 


xnr G±y 


Glu 


Gly 


Cys Val 


Tyr 


Leu Gin Thr Ser 


Leu Lys Tyr Asn 






20 








25 


30 


ixe 


Leu Pro 


Glu 


Lys 


Glu Glu 


Phe 


Pro Phe Ala Leu 


Gly Val Gin Thr 




o c 
JO 








40 




45 


Leu 


Pro Gin 


Thr 


Cys 


Asp Glu 


Pro 


Lys Ala His Thr 


Ser Phe Gin Xle 




50 






55 




60 




Ser 


Leu Ser 


Val 


Ser 


Tyr Thr Gly 


Ser Arg Ser Ala 


Ser Asn Met Ala 
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70 




75 


80 


He 


Val Asp 


Val 


Lys 


Met Val 


Ser 


Gly Phe He Pro 


Leu Lys Pro Thr 








85 






90 


95 


Val 


Lys Met 


Leu 


Glu 


Arg Ser Asn 


His Val Ser Arg 


Thr Glu Val 






100 








105 


110 




<210> 


13 














<211> 
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<212> 


PRT 
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Homo sapiens 










<400> 


13 












Leu 


Gin Gin 


Val 


Ser 


Leu Pro 


Glu 


Leu Pro Gly Glu 


Tyr Ser Met Lys 


1 
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15 


Val 
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Glu 
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Leu Gin Thr Ser 


Leu Lys Tyr Asn 
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30 


He 


Leu Pro 


Glu 


Lys 


Glu Glu 
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Leu 


Pro Gin 


Thr 


Cys Asp Glu 


Pro 


Lys Ala His Thr 
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50 






55 




60 




Ser 
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Ser Tyr Thr Gly 


Ser Arg Ser Ala 
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75 
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He 
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<210> 14 

<211> 101 

<212> FRT 

<213> Homo sapiens 





<400> 


14 
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Lvs 


Tyr Asn He Leu Pro Glu Lys 


Glu Glu Phe 


Pro 
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Phe 
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Val 
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Asp Glu Pro 
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30 
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35 






40 


45 




Arg 
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Ser 
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Gly 




50 
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Phe 


He Pro 


Leu 


Lys 


Pro Thr Val Lys Met Leu Glu 


Arg Ser Asn 
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Val 
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Thr 


Glu 
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